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98 Human Secreted Proteins 
Field of the Invention 
This invention relates to newly identified polynucleotides and the polypeptides 
encoded by these polynucleotides, uses of such polynucleotides and polypeptides, and 
their production. 

Background of the Invention 

Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 
organelle, contains different proteins essential for the function of the organelle. The " 
cell uses "sorting signals," which are amino acid motifs located within the protein, to 
target proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a 
leader sequence, directs a class of proteins to an organelle called the endoplasmic 
reticulum (ER). The ER separates the membrane-bounded proteins from all other 
types of proteins. Once localized to the ER, both groups of proteins can be further 
directed to another organelle called the Goigi apparatus. Here, the Golgi distributes 
the proteins to vesicles, including secretory vesicles, the cell membrane, lysosomcs, 
and the other organelles. 

Proteins targeted to the ER by a signal sequence can be released into the 
extracellular space as a secreted protein. For example, vesicles containing secreted 
proteins can fuse with the cell membrane and release their contents into the 
extracellular space - a process called exocytosis. Exocytosis can occur constitutively 
or after receipt of a triggering signal. In the latter case, the proteins are stored in 
secretory vesicles (or secretory granules) until exocytosis is triggered. Similarly, 
proteins residing on the cell membrane can also be secreted into the extracellular 
space by proteolytic cleavage of a "linker" holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins 
include the commercially valuable human insulin, interferon, Factor VIII, human 
growth hormone, tissue plasminogen activator, and erythropoeitin. Thus, in light of 
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the pervasive role of secreted proteins in human physiology, a need exists for 
identifying and characterizing novel human secreted proteins and the genes that 
encode them. This knowledge will allow one to detect, to treat, and to prevent 
medical disorders by using secreted proteins or the genes that encode them. 

Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host cells, 
antibodies, and recombinjnt methods for producing the polypeptides and 
polynucleotides. Also provided are diagnostic methods for detecting disorders related 
to the polypeptides, and therapeutic methods for treating such disorders. The 
invention further relates to screening methods for identifying binding partners of the 
polypeptides. 

Detailed Description 

Definitions 

The following definitions are provided to facilitate understanding of certain 
terms used throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated'* because that vector, composition of 
matter, or particular cell is not the original environment of the polynucleotide. 

In the present invention, a "secreted" protein refers to those proteins capable 
of being directed to the ER, secretory vesicles, or the extracellular space as a result of 
a signal sequence, as well as those proteins released into the extracellular space 
without necessarily containing a signal sequence. If the secreted protein is released 
into the extracellular space, the secreted protein can undergo extracellular processing 
to produce a "mature" protein. Release into the extracellular space can occur by many 
mechanisms, including exocytosis and proteolytic cleavage. 
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In specific embodiments, the polynucleotides of the invention are less than 
300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, or 7.5 kb in length. In a further 
10 embodiment, polynucleotides of the invention comprise at least 15 contiguous 

nucleotides of the coding sequence, but do not comprise all or a portion of any iniron. 
5 In another embodiment, the nucleic acid comprising the coding sequence does not 
contain coding sequences of a genomic flanking gene (i.e., 5* or 3' to the gene in the 
15 genome). 

As used herein , a "polynucleotide" refers to a molecule having a nucleic acid 
sequence contained in SEQ ID NO:X or the cDNA contained within ihe clone 
10 deposited with the AT CC For example, the polynucleotide can contain the 
nucleotide sequence of the full length cDN A sequence, including the 5' and 3' 
untranslated sequences, the coding region, with or without the signal sequence, the 
secreted protein coding region, as well as fragments, epitopes, domains, and variants 
25 of the nuclcic acid sequence. Moreover, as used herein, a "polypeptide" refers to a 

1 5 molecule having the translated amino acid sequence generated from the 
polynucleotide as broadly defined. 

In the present invention, the full length sequence identified as SEQ ID NO:X 
was often generated by overlapping sequences contained in multiple clones (contig 
analysis). A representative clone containing all or most of the sequence for SEQ ID 
20 NO:X was deposited with the American Type Culture Collection ("ATCC"). As 
35 sbown in Table 1 - cach clone is identified by a cDNA Clone ID (Identifier) and the 

ATCC Deposit Number. The ATCC is located at 10801 University Boulevard, 
Manassas, Virginia 201 10-2209, USA. The ATCC deposit was made pursuant to the 
terms of the Budapest Treaty on the international recognition of the deposit of 
40 25 microorganisms for purposes of patent procedure. 

A "polynucleotide" of the present invention also includes those 
polynucleotides capable of hybridizing, under stringent hybridization conditions, to 
sequences contained in SEQ ID NO:X, the complement thereof, or thecDNA within 
the clone deposited with the ATCC. "Stringent hybridization conditions" refers to an 
30 overnight incubation at 42° C in a solution comprising 50% formamide, 5x SSC (750 
mM NaCl, 75 mM sodium citrate), 50 mM sodium phosphate (pH 7.6). 5x Denhardt's 
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solution, 10% dextran sulfate, and 20 |ig/ml denatured, sheared saimon sperm DNA, 
followed by washing the filters in O.lx SSC at about 65°C 

Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished through the manipulation of forraamide concentration (lower 
percentages of formamide result in lowered stringency); salt conditions, or 
temperature. For example, lower stringency conditions include an overnight 
incubation at 37°C in a sojution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M 
NaH 2 PO d ; 0.02M EDTA, pH 7.4). 0.5% SDS. 30% formamide, 1 00 ug/mi salmon - 
sperm blocking DNA; followed by washes at 50°C with 1XSSPE, 0. 1 % SDS. In 
addition, to achieve even lower stringency, washes performed following stringent 
hybridization can be done at higher salt concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 
Denhardfs reagent, BLOTTO, heparin, denatured salmon sperm DNA. and 
commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, 
due to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such 
as any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide;' since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically 
any double-stranded cDNA clone). 

The polynucleotide of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may he unmodified RNA or 
DNA or modified RNA or DNA. For example, polynucleotides can be composed of 
single- and double-stranded DNA, DNA that is a mixture of single- and double- 
stranded regions, single- and double-stranded RNA, and RNA that is mixture of 
single- and double-stranded regions, hybrid molecules comprising DNA and RNA 
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that may be single-stranded or, more typically, double-stranded or a mixture of single- 
and double-stranded regions. In addition, the polynucleotide can be composed of 
10 triple-stranded regions comprising RNA or DNA or both RNA and DMA. A 

polynucleotide may also contain one or more modified bases or DNA or RNA 
5 backbones modified for stability or for other reasons. "Modified" bases include, for 
example, tritylated bases and unusual bases such as inosine. A variety of 
modifications can be made to DNA and RNA; thus, "polynucleotide" embraces 
chemically, enzymatically, or metabolicaliy modified forms. 

The polypeptide o^the present invention can be composed of amino acids 
10 joined to each other by peptide bonds or modified peptide bonds, i.e., peptide 

isosteres. and may contain amino acids other than the 20 gene-encoded amino acids,' 
The polypeptides may be modified by either natural processes, such as 
posttranslational processing, or by chemical modification techniques which are well 
25 known in the art. Such modifications are well described in basic texts and in more 

15 detailed monographs, as well as in a voluminous research literature. Modifications 
can occur anywhere in a polypeptide, including the peptide backbone, the amino acid 
3Q side-chains and the amino or carboxyl termini, ft will be appreciated that the same 

type of modification may be present in the same or varying degrees at several sites in 
a given polypeptide. Also, a given polypeptide may contain many types of 
20 modifications. Polypeptides may be branched , for example, as a result of 
35 ubiquitination, and they may be cyclic, with or without branching. Cyclic, branched, 

and branched cyclic polypeptides may result from posttranslation natural processes or 
may be made by synthetic methods. Modifications include acetylation, acylation, 
ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a 
25 heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent 
attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, 
cross-linking, cyclization, disulfide bond formation, demethylation, formation of 
covalent cross-links, formation of cysteine, formation of pyroglutamate, formylation, 
gamma-carboxylation, glycosylation, GPI anchor formation, hydroxylation, 
30 iodination, methylation, myristoyiation, oxidation, pegylation, proteolytic processing, 
phosphorylation, prcnylation, racemization, selenoylation, sulfation. transfer-RNA 
mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 
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(See, for instance, PROTEINS - STRUCTURE AND MOLECULAR PROPERTIES, 
2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York (1993); 
POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. 
Johnson, Ed, Academic Press. New York. pgs. 1-12 (1983); Seifter et al., Meth 
Enzymol 182:626-646 (1990); Rattan el aJ., Ann NY Acad Sci 663:48-62 (1992).) 

"SEQ ID NO:X" refers to a polynucleotide sequence while "SEQ ID NO: Y" 
refers to a polypeptide sequence, both sequences identified by an integer specified in 
Table I. 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
activity similar, but not necessarily identical to, an activity of a polypeptide of the. 
present invention, including mature forms, as measured in a particular biological ' 
assay, with or without dose dependency. In the case where dose dependency docs 
exist, it need not be identical to that of the polypeptide, but rather substantially similar 
to the dose-dependence in a given activity as compared to the polypeptide of the 
present invention (i.e., the candidate polypeptide will exhibit greater activity or not 
more than about 25 -fold less and, preferably, not more than about tenfold less 
activity, and most preferably, not more than about three-fold less activity relative to 
the polypeptide of the present invention.) 

Polynucleotides and Polypeptides of the Invention 

FEATURES OF PROTEIN ENCODED BY GENE NO: 1 

The translation product of this gene is a human glycoprotein-associated amino 
acid transporter (See, e.g., Genbank Accession No. emblCAAl0198.1i (AJ130718); 
all references available through this accession are hereby incorporated by reference 
herein). Amino acid transport across cellular membranes is mediated by multiple 
transporters with overlapping specificities. The transport system L, which mediates 
Na+-indcpcndent exchange of large neutral amino acids, consists of a novel amino 
acid permease-related protein (LAT1 or AmAT-L-lc) which for surface expression 
and function requires formation of disulfide-linked heterodimers with the 
glycosylated heavy chain of the h4F2/CD98 surface antigen. h4F2hc also associates 
with other mammalian light chains, e.g. y+LATl from mouse and human which are 
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approximately 48% identical with LAT1 and thus belong to the same family of 
glycoprotein-associated amino acid transporters. 

The novel heterodimers form exchangers which mediate the cellular efflux of 
cationic amino acids and the Na+-dependent uptake of large neutral amino acids. 
T.iese transport characteristics and kinetic and pharmacological fingerprints identify 
them as y+L-typc transport systems. mRNA encoding my+LATl is detectable in most 
adult tissues and expressed at high levels in kidney cortex and intestine. This indicates 
that the y+LATl-4F2hc heterodimer, besides participating in amino acid 
uptake/secretion in many cell types, is the basolatcral amino acid exchanger involved 
in transepitheiial reabsorption of cationic amino acids; hence, its defect might be the 
cause of the human genetic disease lysinuric protein intolerance. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
14. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 14. 

Preferred polypeptides comprise the following amino acid sequence: 
LALYSALFS YSGWDTLN (SEQ ID NO: 237). VTEEIKNPERNLPL (SEQ ID NO: 
238), IGISMPIVT <SEQ ED NO: 239), IYILTNVAYYTVL (SEQ ID NO: 240), SDA 
VAVTFADQ (SEQ rD NO: 241), VALSCFGGLNASI (SEQ ID NO: 242), SRLFFV 
GSREGHLPD (SEQ ID NO: 243), SFSYWFFVGUS (SEQ ID NO: 244), VGQLYLR 
WKEP (SEQ ID NO: 245), RPRPLKLS VFFPIVFC (SEQ ID NO: 246), DTINSLIG1 
(SEQ ID NO: 247), LLAAACICLLTFINCAYVKWGTLVQDIFTYAKVLALIAVI 
V AG WRLGQGASTHFENSraGSSFAVGDIALALYSALFS YSGWDTLN YVTEEI 
KNPERNLPLSIGISMPrVTIIYILTNVAYYTVLDMRDILASDAVAVTFADQI 

NWUPLSVALSCFGGLNASIVAASRLFFVGSREGHLPDAICMIHVERFTPVPSLL 
FNGlMALIYLCVEDffQLINYYSFSYWFFVGL^ 

VF^PIVFCLCTIPLVAVPLYSDriNSLlGIAIALSGLPFYFLURVPEHKRPLYLRRI 
VGSATRYLQVLCMSVAAEMDLEDGGEMPKQRDPKSN (SEQ ID NO: 249) 
and/or ATALPPKIVGSATRYLQVLCMSVAAEMDLEDGGEMPKQRDPKSN 
(SEQ ID NO: 248). Polynucleotides encoding these polypeptides are also provided. 

Contact of ceils with supernatant expressing the product of this gene has been 
shown to increase the permeability of the plasma membrane of THP-l monocyte cells 
to calcium. Thus, it is likely that the product of this gene is involved in a signal 
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transduction pathway that is initiated when the product binds a receptor on the surface 
of the plasma membrane of both THP- 1 monocytes, in addition to other cell-lines or 
tissue cell types. Thus, polynucleotides and polypeptides have uses which include, but 
are not limited to, activating monocytes. 

This gene is expressed primarily in endothelial cells and brain, and, to a lesser 
extent, in a wide variety of human tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders of the neural or gastrointesinal systems. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing ' 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the circulation 
system or central nervous system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., neural, 
gastrointestinal, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e.. the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 124 as residues: Glu-102 to Asn-i 10 t Arg-256 to 
Leu-266, Pro-3 16 to Trp-328, Pro-33 1 to Arg-336. Met-350 to Gty-358. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in brain combined with its homology to a amino acid 
transporter and biological activity of increasing ion flux in monocytes indicates 
polynucleotides and polypeptides corresponding to this gene are useful for the 
detection, treatment and/or prevention of neurodegenerative disease states, behavioral 
disorders, or inflammatory conditions. Representative uses are described in the 
"Regeneration" and "Hyperproliferative Disorders" sections below, in Example 1 1, 
15, and 18, and elsewhere herein. Briefly, the uses include, but are not limited to the 
detection, treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease, 
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Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, dcmyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain indicates it plays a role in 
normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 1 1 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 550 of SEQ ID NO: 1 1 , b is an 
integer of 15 to 1564, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:l 1, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

The gene encoding the disclosed cDNA is believed to reside on the X 

chromosome. Accordingly, polynucleotides related to this invention are useful as a. 

marker in linkage analysis for the X chromosome. 

Preferred polypeptides of the invention comprise the following amino acid 

sequence: 
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AARGSGVRDPLEEAVCPFSDLQLHAGRTTALFKAVRQGHLSLQRLLLSFVCL 
CPAPRGGAYRGRQASLSCGGLHPVRASRLLCLPKQAWAMAGAPPPVSLPPCS 
LISDCCASNQRDS VG (SEQ ID NO: 250). Polynucleotides encoding these 
polypeptides are also provided. 

This gene is expressed primarily in cord blood cells, and, to a lesser extent, in 
frontal lobe of the brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental, reproductive, hematopoietic or neural disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in' 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune and central nervous systems, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., immune, 
developmental, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
amniotic fluid, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue 
or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 125 as residues: His-56 to Gln-65. Leu-80to lle-85. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in cord blood cells indicates polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment and diagnosis of 
hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 
thrombocytopenia or leukemia since stromal cells are important in the production of 
cells of hematopoietic lineages. Representative uses are described in the "Immune 
Activity" and "infectious disease" sections beiow, in Example II, 13. 14, 16, 18, 19, 
20. and 27, and elsewhere herein. Briefly, the uses include bone marrow cell ex -vivo 
culture, bone marrow transplantation, bone marrow reconstitute, radiotherapy or 
chemotherapy of neoplasia. 
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The gene product may also be involved in lymphopoiesis, therefore, it can be 
used in immune disorders such as infection, inflammation, allergy, immunodeficiency 
etc. In addition, this gene product may have commercial utility in the expansion of 
stem cells and committed progenitors of various blood lineages, and in the 
differentiation and/or proliferation of various cell types. Alternatively, 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and treatment of cancer and other proliferative disorders. Expression within 
embryonic tissue and other cellular sources marked by proliferating cells indicates 
that this protein may play ajole in the regulation of cellular division. Similarly, 
embryonic development also involves decisions involving cell differentiation and/or 
apoptosis in pattern formation. Thus, this protein may also be involved in apoptosis or 
tissue differentiation and could again be useful in cancer therapy. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as. 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 12 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1743 of SEQ ID NO:12, b is an 
integer of 15 to 1757, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 12, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 3 

This gene is expressed primarily in human T cell lymphomas. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biologicai sample and for diagnosis of diseases and conditions which include, but are 
not limited to, T cell lymphoma, immunodeficiencies, in addition to other immune 
system disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identificauon of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., immune, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an ' 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 126 as residues: Met-1 to Phe-10. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in human T cell lymphomas indicates polynucleotides 
and polypeptides corresponding to this gene are useful for the diagnosis and treatment 
of a variety of immune system disorders. Representative uses are described in the 
"Immune Activity" and "infectious disease" sections below, in Example 11, 13, 14, 
16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the expression of this gene 
product indicates a role in regulating the proliferation; survival; differentiation; and/or 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
other processes suggesting a usefulness in the treatment of cancer (e.g., by boosting 
immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft- versus-host diseases, or autoimmunity 
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disorders, such as autoimmune infertility, (ense tissue injury, demyelination, systemic 
lupus erythernatosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Furthermore, the protein may also be used to determine biological activity, 
raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify 
agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as. antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 13 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-K where a is any integer between 1 to 1 359 of SEQ ID NO: 13, b is an 
integer of 15 to 1373, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 13, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 4 

The protein product of this clone shares sequence homology with the C- 
teminus of a human N-acetylglucosamine-phosphate mutase (See, e.g.. Genbank 
Accession No. gbIA AC72409.il (AF102265); all references available through this 
accession are hereby incorporated by reference herein.) Hofmann, eL al. (Eur. J. 
Biochem. 221:741-747 (1994)) studied the N-acetylglucosamine-phosphate mutase of 
Saccharomyces cerevisiae and showed it to be essential for viability. A S. cerevisiae 
agml deletion mutant progressed through only approximately five cell cycles to form 
a string" of undivided cells with an abnormal cell morphology resembling 
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glucosamine auxotrophic mutants. Expression of the AGM1 gene on a multi-copy 
plasmid led to a significantly increased N-acetylglucosamine-phosphate mutase 
activity. Unlike over-expression of the S. cerevisiae AGMl gene in a 
phosphoglucomutase (pgml delta/pgm2 delta) double deletion mutant which could 
restore phosphoglucomutase activity, over-expression of the PGM2 gene encoding the 
major isoenzyme of phosphoglucomutase did not increase N-acetylgiucosamine- 
phosphate-mutase activity and did not restore growth of agml deletion mutant cells. 
These observations indicate that the different hexosephosphate mutases of S. 
cerevisiae have partially overlapping substrate specificities but, nevertheless, distinct 
physiological functions. The human N-acetyiglucosamine-phosphate-mutase is 
expected to share at least some biological activities with the Agm 1 protein. 

Preferred polypeptide fragments of the invention comprise the following 
amino acid sequences: LSKAFLDSPNRLLAVEMr^DHLRLTVPNGIGALKLRXM 
EHYFSQGLSVQLFNDGSKGKJ-NHLCGADFVKSHQICPPQGMEIKSNERCCSFD 
GDADRTVYYYHDADGHFHLIDGDKIATLISSFLICELLVEIGESLNIGVVQTAYA 
NGSSTRYLEEVMKVPVYCTKTGVKHLHHKAQEFDIGVYFEANGHGTALFST 
AVEMKIKQSAEQLEDKKRKAAKMLENIIDLFNQAAGDAISDMLVIEAILALK 
GLWQQWDALYTDLPNRQLKVQVADRRVISTTXAERQAVTPPGLQEATNDL 
VKKYKLSRAFVRPSGTEDVVRVYAEADSQESADHLAHEVSLAVFQLAGGfGE 
RPQPGF (SEQ ID NO: 250, LS K AFLDSPNRLL A VEMNTDHLRLT V (SEQ ID 
NO: 252), PNGIGALKLRXMEHYFSQGLSVQLFNDG (SEQ ID NO: 253), SKGKL 
NHLCGADFVKSHQKPPQGMEIKS (SEQ ID NO: 254), NERCCSFDGDADRIV 
Y YYHDA DGHFHLI (SEQ ID NO: 255), DGDKIATLISSFLKELLVEIGESLNIGV 
(SEQ ID NO: 256), VQTAYANGSSTRYLEEVMKVPVYCTKTG (SEQ ID NO: 
257), VKHLHHKAQEFDIGVYFEANGHGTALFS (SEQ ID NO: 258), TAVEMK 
IKQSAEQLEDKFCRKAAKMLENI (SEQ ID NO: 259). IDLFNQAAGDAISDM 
LVIEAILALKGLT (SEQ ID NO: 260), VQQWDALYTDLPNRQLKVQVADRR 
VIST (SEQ ID NO: 261), TXAERQAVTPPGLQEAINDLVKKYKLSR (SEQ ID 
NO: 262), AFVRPSGTEDVVRVYAEADSQESA (SEQ ID NO: 263), and/or DH 
LAHEVSLAVFQLAGGIGERPQPGF (SEQ ID NO: 264). Polynucleotides encoding 
these polypeptides are also provided. 
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The gene encoding the disclosed cDNA is believed to reside on chromosome 
6. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 6. 

This gene is expressed primarily in fetal brain, and, to a lesser extent, in a 
wide variety of human tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental disorders, particularly of the central nervous system. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell * 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
central and peripheral nervous system, expression of this gene at significantly higher 
or lower levels is routinely detected in certain tissues or cell types (e.g., neural, or 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 127 as residues: Asn-36 to Lys-42, Lys-53 to OIn-60, 
Ile-64 to Ala-77, Ala- 128 to Tyr-135, Lys-184 to Ala- 199, Leu-245 to Leu-250. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution of N-acetylglucosarnine-phosphate mutase in fetal 
brain indicates polynucleotides and polypeptides corresponding to this gene are useful 
for the detection, treatment, and/or prevention of neurodegenerative disease states, 
behavioral disorders, or inflammatory conditions. Representative uses are described in 
the "Regeneration" and "Hypcrproliferative Disorders" sections below, in Example 
1 1, 15, and 18, and elsewhere herein. Briefly, the uses include, but are not limited to 
the detection, treatment, and/or prevention of Alzheimer's Disease. Parkinson's 
Disease, Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, 
demyelinating diseases, peripheral neuropathies, neoplasia, trauma, congenital 
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malformations, spinal cord injuries, ischemia and infarction, aneurysms, hemorrhages, 
schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder, depression, 
panic disorder, learning disabilities, ALS, psychoses, autism, and altered behaviors, 
including disorders in feeding, sleep patterns, balance, and perception. In addition, 
elevated expression of this gene product in regions of the brain indicates it plays a 
role in normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional ' 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 14 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 3726 of SEQ ID NO: 14, b is an 
integer of 15 to 3740. where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 14, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODKD BY GENE NO: 5 

This gene is expressed primarily in human stomach and stomach tumor cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders of the gastrointestinal system, particularly cancer or ulcers of 
stomach tissue. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 



WO 00/06698 



17 



PCT/LS99/17130 



tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the digestive system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or ceil types (e.g., gastrointestinal, 
or cancerous and wounded tissues) or bodily fluids (e.g., bile, lymph, serum, plasma, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in tumors of the stomach indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for diagnosis, treatment and 
intervention of these tumors, in addition to other tumors where expression has been 
indicated. Additionally, the protein product of this gene may play a role in the normal 
function of the stomach and/or digestive system. Furthermore, the protein may also be 
used to deiermine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO; 15 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 1 82 of SEQ ID NO: 1 5, b is an 
integer of 15 to 11 96, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 15, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 6 

Preferred polypeptides of the invention comprise the following amino acid 
sequences: 
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FEIALPRESNfTVLIKLGTPTLLAKPCYlVISKRHITNlLSIKSGERIVFTFSCQSP^ 

NHFVIEIQKJS4DCMSGPCPFGEVQLQPSTSLLPTLNRTFIWDVKAHKSIGLELQ 

FSIPRLRQIGPGESCPDGVTHSISGRIDATVVRIGTFCSNGTVSRIKM (SEQ ID 

NO: 266), and/or GTRAAPGLGAWGRRSPPSFSPPRPRRPG VMAGLNCGVSIAL 

LGVLLLGAARLPRGAEAFEIALPRESNITVLIKLGTPTXLAKPCYIVISKRHTTM 

LSIKSGERIVFTreCQSPENHFVIEIQKNIDCMSGPCPFGEVQLQPSTSLLPTLNR 

TFIWDVKAHKSIGLELQFSIPRLRQIGPGESCPDGVTHSISGRIDATVVRIGTFC 

SNGTVSR[K^fQRGVKMALHLPWFHPRNVSGFSIANRSSIKRLaIESVFEGEGS 

ATLMSANYPEGFPEDELMTWQFWPAHLRASVSFLNFNL^^ 

fPGSTTNTEVFKLEDKQPGNMAGNFNLSLQGCDQDAQSPGILRLQFQVLVQH 
PQNESNKIYVVDLSNERAMSLTIEPRPVKQSRKFVPGCFVCLESRTCSSKLTLT 
SGSKHKISFLCDDLTRLWMNVEKP (SEQ ID NO: 265). Polynucleotides encoding 
these polypeptides are also provided. 

This gene is expressed primarily in placenta, and to a lesser extent in, prostate 
and ovary. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. male and female infertility, and associated disorders of the 
reproductive system. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the reproductive system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., reproductive, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
amniotic fluid, seminal Quid, serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 

The tissue distribution of this gene in the prostate, placenta and ovary 
indicates that polynucleotides and polypeptides corresponding to this gene are useful 
for treatment, prevention, and/or diagnosis of male or female infertility, endocrine 
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disorders, fetal deficiencies, ovarian failure, amenorrhea, ovarian cancer, benign 
prostate hyperplasia and prostate cancer. Similarly, the tissue distribution indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and treatment of cancer and other proliferative disorders. Expression within 
placental tissue and other cellular sources marked by proliferating cells indicates that 
this protein may play a role in the regulation of cellular division. Similarly, embryonic 
development also involves decisions involving cell differentiation and/or apoptosis in 
pattern formation. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Furthermore, the protein 
may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their' 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 16 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 2 1 95 of SEQ ID NO: 1 6, b is an 
integer of 15 to 2209, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 16, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

The translation product of this gene shares homology with the human and rat 
HNK-l sulfotransfcrase protein (Sec, e.g., Genbank Accession Nos. gblAAB88123.ll 
(AF022729) and gil292l3061gblAAC04707.il (AF033827); all references available 
through these accessions are hereby incorporated herein by reference.) Ong E, et al. (J 
Biol Chem. 273(9):5 190-5 (1998)) have characterized the human HNK-1 
sulfotransfcrase, and show that it is involved in the synthesis of the HNK-I 
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carbohydrate epitope which is expressed on various adhesion molecules in the 
nervous system and on immune cells (e.g.. natural killer cells) and is suggested to 
play a role in cell-cell and cell-substratum interactions. Based on the sequence 
similarity, the translation product of this gene is expected to share at least some 
biological activities with HNFC- 1 sulfotransferase proteins. Such activities are known 
in the an, some of which are described elsewhere herein, or in, for example, Bakker, 
et al., J Biol Chem. 272:29942-6 ( 1997), incorporated herein by reference. Based on 
sequence similarity between sulfotransferases, a consensus sequence for the active 
site was developed (Ong, et al., supra). The consensus pattern is as follows: 
xxRPDzzzz, where x represents hydrophobic amino acid residues and z represents 
any amino acid residue. Therefore, 

Preferred polypeptides of the invention comprise the following amino acid 
sequences: FVRDPFVRL (SEQ ID NO: 267), FLFVRDPFVRLIS (SEQ ID NO: 
268), FLFVRDPFVRLIS AF (SEQ ID NO: 269), and/or YLHTSFSRPHTGPPLPTPG 
PDRDRELT ADS D VDEFLDKFLS AG V KQS DLPR KETEQPPAPGS MEEN VRG Y 
DWSPRDARRS PDQGRQQAERRS VLRGFC ANS SLAFPTKERAFDDIPNS ELSHL 
IVDDRHGAIYCYVPKVACTNWKRVMIVLSGSLLHRGAPYRDPLRIPREHVH 
NASAHLTWKFWRRYGKLSRHLMKVKLKKYTKFLFVRDPFVRLIS 
FELENEEFYRKFAVPMLRLYANHTSLPASAREAFRAGLKVSFANFIQYLLDPH 
TEKLAPFNEHWRQVYRLCHPCQ1DYDFVGKLETLDEDAAQLLQLLQVDRQ 
LRFPPSYRNRTASSWEEDWFAKIPLAWRQQLYKLYEADFVLFGYPKPENLL 
RD (SEQ ID NO: 270). Polynucleotides encoding these polypeptides are also 
provided. 

Further preferred are the sulfotransferase active site polypeptides listed above, 
and at least 5, 10, 15, 20, 25 5 30, 50, or 75 additional contiguous amino acid residues 
of the sequence referenced in Table 1 for this gene. The additional contiguous amino 
acid residues is N-terminal or C- terminal to the sulfotransferase active site 
polypeptides. Alternatively, the additional contiguous amino acid residues is both N- 
terminal and C-terminal to the sulfotransferase active site polypeptides, wherein the 
total N- and C-terminal contiguous amino acid residues equal the specified number. 
The above preferred polypeptide domains arc characteristic of a signature specific to 
sulfotransferase proteins. The nucleotides sequence of this gene was found to be 
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homologous to the human hypoxanthine guanine phosphoribosyl transferase 2 cDNA 
which is know to be involved in the purine salvage pathway resulting in the 
maimainance of homeostatic levels of uric acid (See Genbank Accession No.T30127). 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
7. Accordingly, polynucleotides related to this invention arc useful as a marker in 
linkage analysis for chromosome 7. 

This gene is expressed to a very high level in HL-60 myelogenous leukemia 
cell lines, and to a lesser extent, in most cell types of the immune system. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, bur are 
not limited to, immune, myelopoiesis, and metabolic disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune and 
hematopoietic systems, expression of this gene at significantly higher or lower levels 
is routinely detected in certain tissues or cell types (e.g., immune, metabolic, or 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 130 as residues: Ser-39 to Gly-46, Leu-49 to Ala-62, 
Lys-79 to AIa-93, Gly-95 lo Asp- 1 05, Ser- 107 to Val- 127, Gly-193 to Leu-200, Lys- 
218 to Ser-227, Lys-234 to Thr-239, Pro-366 to Asp-379, Pro-406 to Asp-414. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in HL-60 myelogenous leukemia cell lines and 
homology to HNK-l sulfotransferase proteins indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis, prevention 
and/or treatment of a variety of immune system disorders, including but not limited 
to ; those involving the HNK- 1 carbohydrate epitope, (e.g. HNK-l as an auto-antigen 
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in peripheral demyelinative neuropathy). Representative uses are described in the 
"Immune Activity" and "infectious disease" sections below, in Example 11,13, 14, 
16, 18, 19, 20. and 27, and elsewhere herein. Briefly, expression of this gene product 
in tonsils indicates a role in regulating the proliferation; survival; differentiation; 
and/or activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
other processes suggesting a usefulness in the treatment of cancer (e.g., by boosting 
immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases sudh 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other hlood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and couimitted progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Alternatively, the homology to a conserved purine metabolism protein may 
suggest that polynucleotides and polypeptides corresponding to this gene are useful 
for the diagnosis, prevention, and/or treatment of various metabolic disorders such as 
Tay-Sach's Disease, phenylkenonuria, galactosemia, porphyrias, Hurler's syndrome, 
and various urogenital disorders related to metabolic conditions, particularly Lesch- 
Nyhan syndrome. Furthermore, the protein may also be used to determine biological 
activity, to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, 
to identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 17 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-h, where a is any integer between 1 to 1760 of SEQ ID NO:17, b is an 
integer of 15 to 1774, where both a and b correspond to the positions of nucleotide 

4'' 

residues shown in SEQ ID NO: 17, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 8 

When tested against Jurkat T-cell lines, supernatants removed from cells 
containing this gene activated the gamma activating sequence (GAS) promoter 
element. GAS is a promoter element found upstream of many genes which are 
involved in the Jak-STAT pathway, a large, signal transduction pathway involved in 
the differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate 
proteins involved in the proliferation and differentiation of cells. Thus, it is likely that 
this gene activates T-cells through the Jak-STAT signal transduction pathway. 

In a specific embodiment, polypeptides comprising the amino acid sequence 
of the open reading frame upstream of the predicted signal peptide are contemplated 
by the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: KLVRLQVPVRNSRVDPRVRSKIGSRRWMLQLI 
MQLGSVLLTRCPFWGCFSQLMLYAERAEARRKPDIPVPYLYFDMGAAVLCA 
SFMSFGVKRRWFALGAALQLALSTYAAYIGGYVHYGDWLKVRMYSRTVAII 
GGFLVLASGAGELYRRKPRSRSLQSTGQVFLGIYLICVAYSLQHSKEDRLA 
YLNHLPGGELMIQLFFVLYGILALAFLSGYYVTLAAQILAVLLPPVMLLIDG 
NVAYWHNTRRVEFWNQMKLLGESVGIFGTAVILATDG (SEQ ID NO: 271). 

A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MQLGSVLLTRCPFWGCFSQLMLYAERAEARRKPDIPVP 
YLYFDMGAAVLCASFMSFGVKRRWFALGAALQLAISTYAAYIGGYVHYGD 
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WLKVRMYSRTVAnGGFLVLASGAGELYRRKPRSRSLQSTGQVFLGrYLICVA 
YSLQHSK£DRLAYLNHLPGGELMQUTVLYGILAPGLSVRLLRDPRCPDPGC 
TAAPCHAAH (SEQ ID NO: 272). Polynucleotides encoding these polypeptides are 
also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
17. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 17, 

This gene is expressed primarily in endometrial tumors, and to a lesser extent, 
in T-cells, pituitary and to a certain extent in most cell types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a * 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, female reproductive, immune, or endocrine disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the reproductive 
and/or immune systems expression of this gene at significantly higher or lower levels 
is routinely detected in certain tissues or cell types (e.g., immune, reproductive, or 
cancerous and wounded tissues) or bodily fluids (e.g., amniotic fluid I lymph, serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 13 1 as residues: AIa-27 to Asp-34, Tyr-1 16 to Leu- 
125. Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution predominantly in the endometrium indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
detection, treatment, and/or prevention of a range of immune and/or reproductive 
disorders including endometriosis, endometritis, and endometrioma. Similarly, the 
tissue distribution in T-cells and the ability of supernatants expressing this gene to 
stimulate the GAS promoter element in T-cells indicates polynucleotides and 
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polypeptides corresponding to this gene are useful for the diagnosis and treatment of a 
variety of immune system disorders. Representative uses are described in the 
"Immune Activity" and "infectious disease* sections below, in Example 11,13. 14, 
16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the expression of this gene 
product indicates a role in regulating the proliferation; survival: differentiation; and/or 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
other processes suggesting a usefulness in the treatment of cancer (e.g., by boosting 
immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also used as an agent for' 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various eel! 
types. Alternatively, the tissue distribution in pituitary indicates polynucleotides and 
polypeptides corresponding to this gene are useful for the detection, treatment, and/or 
prevention of various endocrine disorders and cancers. Representative uses are 
described in the "Biological Activity", "Hypcrproliferative Disorders", and "Binding 
Activity" sections below, in Example 1 1, 17, 18, 19, 20 and 27, and elsewhere herein. 
Briefly, the protein can be used for the detection, treatment, and/or prevention of the 
Addison's Disease, Cushing's Syndrome, and disorders and/or cancers of the pancreas 
(e.g., diabetes mellitus), adrenal cortex, ovaries, pituitary (e.g., hyper-, 



WO 00/06698 



26 



PCT/US99/17130 



hypopituitarism), thyroid (e.g., hyper-, hypothyroidism), parathyroid (e.g., hypcr- 
,hypoparathyroidism) ; hypothallamus, and testes. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 1 8 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides arc specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1660 of SEQ ID NO: 1 8, b is an 
integer of 1 5 to 1 674, where both a and b correspond to the positions of nucleotide * 
residues shown in SEQ ID NO: 1 8, and. where b is greater than or equal to a +- 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 9 

Contact of cells with supernatant expressing the product of this gene has been 
shown to increase the permeability of the plasma membrane of the myeloid leukemia 
cell line AML-193 to calcium. Thus, it is likely that the product of this gene is 
involved in a signal transduction pathway that is initiated when the product binds a 
receptor on the surface of the plasma membrane of myeloid leukemia cells, in 
addition to other cell-lines or tissue cell types. Thus, polynucleotides and polypeptides 
have uses which include, but are not limited lo, activating myeloid leukemia cells. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: SNEILLSFPQNYYIQWLNGSLIHGLWNLASLFS 
NLCLFVLMPFAFF^ESEGbAOLKKGlRARlLETLVMLLLLALLILGrVWVAS 
AJLIDr^AASMESLYDLWEFYLPYLYSCKLMGCIX^ 

HLLVKPTILEDLDEQIYIITLEEEALQRRLNGLSSSVEYNIMELEQELENVKTL 

KTKLERRKKASAWERNLVYPAVMVLLLIETSISVLLVACNrLCLLVDETAM 

PKGTRGPGIGNASLSTFGFVGAALETILFFYI.MVSSVVGFYSLRFFGNFTPKKD 

DTTMTKIIGNCVSILVLSSALPVMSRTLGITRFDLLGDFGRFNWLGNFYIVLS 

YNLLFAIVTTLCLVRFCPTSAVREELFKALGLHKLHLPNTSRDSETAKPSVNGH 
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QKAL (SEQ ID NO: 273). Polynucleotides encoding ihese polypeptides are also 
provided. 

This gene is expressed primarily in fetai heart, and to a lesser extent, in colon 
and the adult pulmonary system. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, heart, lung and digestive disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the cardiovascular, pulmonary * 
and digestive systems, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., developmental, cardiovascular, 
or cancerous and wounded tissues) or bodily fluids (e.g., lymph, amniotic fluid, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic . 
epitopes shown in SEQ ID NO: 132 as residues: GIu-67 to Asn-74, G(u-88 to Asn-93, 
Lys-95 to AIa-107, Ala-147 to Arg-153, Phe-197 toThr-205, Pro-292 to His-308. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in fetal heart, colon and pulmonary tissues and the 
biological activity in increasing the permeability of the plasma membrane of the 
myeloid leukemia ceil line AML-193 to calcium, likely indicating that the product of 
this gene is involved in a signal transduction pathway that is initiated when the 
product binds a receptor on the surface of the plasma membrane of myeloid leukemia 
cells, indicates that polynucleotides and polypeptides corresponding to this gene are 
useful for the treatment, prevention, and/or detection of a range of disorders including 
a variety of vascular disorders and conditions, which include, but are not limited to 
miscrovascular disease, vascular leak syndrome, aneurysm, stroke, embolism, 
myocardial infarction, myocarditis, ischemia, thrombosis, coronary artery disease, 
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arteriosclerosis, and/or atherosclerosis; pulmonary edema and embolism, bronchitis 
and/or cystic fibrosis: Crohn's Disease anoVor colon cancer. Similarly, the tissue 
distribution indicates that polynucleotides and polypeptides corresponding to this 
gene are useful for the diagnosis and treatment of cancer and other proliferative 
disorders. Expression within embryonic tissue and other cellular sources marked by 
proliferating cells indicates that this protein may play a role in the regulation of 
cellular division. Similarly, embryonic deveJopment also involves decisions involving 
cell differentiation and/or apoptosis in pattern formation. Thus this protein may also 
be involved in apoptosis or tissue differentiation and could again be useful in cancer 
therapy. Furthermore v the protein may also be used to determine biological activity, to 
raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify 
agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 19 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention arc one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2004 of SEQ ID NO: 19, b is an 
integer of 15 to 2018. where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 19, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 10 

The protein product of this clone shares sequence homology with the human 
MaxiK channel beta 2 subunit (See, Genbank Accession No. 
gblAAD23380.1IAF099137_l (AF099137); all references available through this 
accession are hereby incorporated herein by reference), which is believed to be a 
modulatory subunit of the voltage and Ca2+ activated K+ (MaxiK) channel. 
Additionally, this protein shares homology to the human calcium-activated potassium 
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channel beta subunit, which, when combined with its corresponding alpha subunit and 
modulating peptide, are believed to be useful in treating asthma, angina, hypertension, 
incontinence, migraine, irritable bowel syndrome (IBS). The subsequent 
heteromultimer that forms upon combining the alpha, beta, and modulator subunits 
are also thought to be useful in preventing premature labour, preventing and treating 
cerebral ischemia, inducing pain modulation and decreasing neurogenic inflammation 
in a patient (See GeneSeq Accession No. R85306). 

Preferred polypeptides comprise the soluble domain which consists of the 
following amino acid sequence: RS YMQS VWTEESQCTLLN ASITETFNCSFSCGP 
DCWKJ^QYPCLQVYV^ 

SLVNVVMENFRKYQHFSCYSDPEGNQKSVILTKLYSSNVT.FHSLFWPTCMMA 
GGVAIVAMVKLTQYLSLLCERIQRINR (SEQ ID NO: 274). Polynucleotides 
encoding these polypeptides are also provided. Based on the sequence similarity, the 
translauon product of this gene is expected to share at least some biological activities 
with modulatory subunits of voltage and Ca2+ activated K+ channel proteins. Such 
activities are known in the art, some of which are described in Wallner. et al., PNAS 
96:4 137-4 142 (1999), incorporated herein by reference! 

This gene is expressed primarily in adrenal gland tumor, and to a lesser extent, 
in Hodgkin's lymphoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, endocrine and immune disorders, particularly Hodgkin's Lymphoma. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the ussuc(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune and/or endocrine systems, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., immune, 
endocrine, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, scrum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
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level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 133 as residues: Trp-25 to Gln-30, Pro-50 to Gln-57, 
Pro-93 to Glu-iOU Arg-l 14 to Cys-121, Scr-123 to Gln-129, Ile-177 to Arg-182. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in adrenal gland tumor and it's identification as the 
modulatory subunit of the voltage and Ca2+ activated K+ (MaxiK) channel indicates 
that polynucleotides and polypeptides corresponding to this gene arc useful for the 
detection, treatment, and/or prevention of various endocrine disorders and cancers, 
particularly Addison's Disease, Cushing's Syndrome, and disorders and/or cancers of 
the pancreas (e.g., diabetes mellitus), adrenal cortex, ovaries, pituitary (e.g., hyper-, 
hypopituitarism), thyroid (e.g., hyper-, hypothyroidism), parathyroid (e.g., hyper- 
,hypoparathyroidism), hypothalamus, and testes. Alternatively, expression in 
proliferative immune tissues combined with its homology to a novel human K 
channel indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for the diagnosis and treatment of a variety of immune system disorders. 
Representative uses are described in the "Immune Activity" and "infectious disease" 
sections below, in Example 1 1, 13, 14, 16, 18, 19, 20, and 27, and elsewhere herein. 
Briefly, the expression of this gene product in Hodgkin's lymphoma indicates a role in 
regulating the proliferation; survival; differentiation; and/or activation of 
hematopoietic cell lineages, including blood stem cells. This gene product is involved 
in the regulation of cytokine production, antigen presentation, or other processes 
suggesting a usefulness in the treatment of cancer (e.g., by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host- versus -graft and graft- versus-host diseases, or autoimmunity 
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disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythernatosis. drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Furthermore, the protein may also be used to determine biological activity, 
raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify 
agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor murker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:20 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2084 of SEQ ID NO:20, b is an 
integer of 15 to 2098, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:20, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 11 

The translation product of this gene shares homology with collagen and 
collagen like proteins (See, e.g., Genbank Accession Nos. 
gil2920535lgblAAC39658.II (AF018081) and gi!2384942lgbl AAB6996 1 . 1! 
( AF022985); all references available through these accession numbers are hereby 
incorporated by reference herein). Additionally, it has been determined that this gene 
has homology to the human Kruppel related zinc finger protein (HTF10) which is 
known to be important as a transcription factor, particularly in development (See 
Genebank Accession No.Ll 1672). 
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In a specific embodiment, polypeptides comprising the amino acid sequence 
of the open reading frame upstream of the predicted signal peptide are contemplated 
by the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: AFAHLQLGPMWKLWRAEEGAAALGGALFLLL 
FA LGVRQLLKQRRPMGFPPGPPGLPFIGNI YSLA ASSELPH V YMRKQSQV YG 
EVQPRRAPGREGRQAGPGWPGPSWLDLWPPLGRLVGTSPCAGCPLRDTRFPG 
LEGRS PRRRAPLQGEPRPCR (SEQ ID NO: 275). Polynucleotides encoding these 
polypeptides are also provided. 

This gene is expressed primarily in human erythroleukemia ceil line ( HFJ ) 
serum induced smooth muscle, and to a lesser extent in human 8 week whole embryo. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, leukemia, musculoskeletal, or developmental disorders. Similarly, 
polypeptides and antibodies directed tq these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the hematopoietic 
system and muscular system, expression of this gene at significantly higher or lower 
levels is routinely detected in certain tissues or cell types (e.g., immune, 
musculoskeletal, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
amniotic fluid, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue 
or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO; 134 as residues: Leu-30 to Gly-38, Arg-67 to Val-72, 
Val-76 to Ala-89, Pro-118 to Ajg-123, Gly-129 to Ala-136, Leu-138 to Arg-146. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in human erythroleukemia cell line (HEL), and serum 
induced smooth muscle, and the shared homology with collagen and collagen like 
proteins indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for disorders of hematopoietic or muscular systems, such as leukemia and 
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muscular dystrophy. Additionally, the shared homology with collagen proteins would 

suggest that this protein may also be important in the diagnosis or treatment of 
10 various autoimmune disorders (i.e., rheumatoid arthritis, lupus, scleroderma, 

dermatomyositis, etc.), dwarfism, spinal deformation, joint abnormalities, and 
5 chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, familial 

osteoarthritis, Atelosteogcnesis type II, metaphyseal chondrodysplasia type Schmid, 
15 etc.). 

The secreted protein can also he used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions and as nutritional supplements. It may also have a 
very wide range of biological activities although no evidence for any is provided in * 
the specification. Typical of these are cytokine, cell proliferauoriAiiffercntiauon 
modulating activity or induction of other cytokines; 
25 »mmunostimulating/immunosuppressant activities (e.g., for treating human 

15 immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of haematopoiesis (e.g., for treating anaemia or as adjunct to 
chemotherapy); stimulation of growth of bone, cartilage, tendons, ligaments and/or 
nerves (e.g., for treating wounds, stimulation of follicle stimulating hormone (for 
control of fertility); chemotactic and chemokinetic activities (e.g., for treating 
20 infections, tumours); haemostatic or thrombolytic activity (e.g., for treating 
35 haemophilia, cardiac infarction etc.); anti-inflammatory activity (e.g., for treating 

septic shock, Crohn's Disease); as antimicrobials; for treating psoriasis or other 
hyperproliferativedisea.se; for regulation of metabolism, behaviour, and many others. 
Also contemplated is the use of the corresponding nucleic acid in gene therapy 
procedures. Furthermore, the protein may also be used to determine biological 
activity, to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, 
to identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/ur immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
50 related to SEQ ID NO:2 1 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 1732 of SEQ ID NO:21, b is an 
integer of 15 to 1746, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:21, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEJN ENCODED BY GENE NO: 12 
20 10 A preferred polypeptide fragment of the invention comprises the following * 

amino acid sequence: MRVRIGLTLLLCAVLLSLASASSDEEGSQD 
ESLGFQDYFDIR (SEQ ID NO: 276). Polynucleotides encoding these polypeptides 
are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
8. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 8. 
30 This S ene is expressed primarily in dendritic cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. immune disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
25 significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., immune, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
45 taken fro,n an indi v »<hal having such a disorder, relative to the standard gene 

expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
30 individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 135 as residues: Ser-22 to Ser-4K GIu-43 toThr-50, 
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Ser-63 to Leu-68 r Ser-7 1 to Gly-84, Ser-96 to Gly-l 14. Polynucleotides encoding said 
polypeptides are also provided. 

The tissue distribution in dendritic cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis, prevention, 
and/or treatment of a variety of immune system disorders. Representative uses are 
described in the "Immune Activity" and "infectious disease" sections below, in 
Example 11, 13, 14, 16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the 
expression of this gene product indicates a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including 
blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the * 
treatment of cancer (e.g., by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatou's Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft- versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and comiiutted progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. 

The secreted protein can be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions and as nutritional supplements. It may also have a 
very wide range of biological activities although no evidence for any is provided in 
the specification. Typical of these arc cytokine, cell proliferation/differentiation 
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modulating activity or induction of other cytokines; 
immunostimuiating/immunosuppressant activities (e.g. ; for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of haematopoiesis (e.g., for treating anaemia or as adjunct to 
chemotherapy); stimulation of growth of bone, cartilage, tendons, ligaments and/or 
nerves (e.g., for treating wounds, stimulation of follicle stimulating hormone (for 
control of fertility); chemotactic and chemokinetic activities (e.g., for treating 
infections, tumours); haemostatic or thrombolytic activity (e.g., for treating 
haemophilia, cardiac infarction etc.); anti-inflammatory activity (e.g., for treating 
septic shock, Crohn's Disease); as antimicrobials; for treating psoriasis or other 
hyperproliferative disease; for regulation of metabolism, behaviour, and many others. 
Also contemplated is the use of the corresponding nucleic acid in gene therapy 
procedures. Furthermore, the protein may also be used to determine biological 
activity, raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as. antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 22 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer, between 1 to 2862 of SEQ ID NO:22, b is an 
integer of 15 to 2876, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ TO NO:22, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 13 

A preferred polypeptide variant of the invention comprises the following 
amino acid sequence: MARGSLRRLLRLLVLGLWLALLRSVAGEQAPGTAPC 
SRGSSWSADLDKCMDCSTSCPLPA ALAHPWGRSEPDLRAGAAFWLFGLE 
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TMPQE REVHHPHRGDRRRGLPSCGADPVTMCPLPAGARPLIIHSSILEPVSAS 
QTRREPSSSNHK GGGGR (SEQ ID NO: 277). Polynucleotides encoding these 
polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
16. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 16. 

This gene is expressed primarily in tumor growth factor or lipopolysaccharide 
treated bone marrow stroma, epithelioid sarcoma, umbilical vein endothelial cells, and 
to a lesser extent, in other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a • 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, hematopoiesis or immune disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the hematopoietic, 
integumentary, or immune systems expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., hematopoietic, 
immune, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO; 136 as residues; Pro-35 to Trp-42, Pro-65 to Asp-72, 
Thr-86 to Phe-93, Ile-97 to Glu-103. Polynucleotides encoding said polypeptides are 
also provided. 

The tissue distribution in tumor growth factor or lipopolysaccharide treated 
bone marrow stroma, epithelioid sarcoma, and umbilical vein endothelial cells 
indicates that polynucleotides and polypeptides corresponding to this gene are useful 
for the diagnosis, prevention, and/or treatment of a variety of immune system 
disorders. Representative uses are described in the "Immune Activity" and "infectious 
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disease" sections below, in Example 1 1, 13, 14, 16, 18, 19, 20, and 27, and elsewhere 
herein. Briefly, the expression of this gene product indicates a role in regulating the 
proliferation; survival; differentiation; and/or activation of hematopoietic cell 
lineages, including blood stem cells. This gene product is involved in the regulation of 
cytokine production, antigen presentation, or other processes suggesting a usefulness 
in the treatment of cancer (e.g., by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities^ ■ 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft- versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. 

The secreted protein can be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions and as nutritional supplements. It may also have a 
very wide range of biological activities although no evidence for any is provided in 
the specification. Typical of these are cytokine, cell proliferation/differentiation 
modulating activity or induction of other cytokines; 
immunostimuJating/immunosuppressant activities (e.g., for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of haematopoiesis (e.g., for treating anaemia or as adjunct to 
chemotherapy); stimulation of growth of bone, cartilage, tendons, ligaments and/or 
nerves (e.g., for treating wounds, stimulation of follicle stimulating hormone (for 
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20 



control of fertility); chemocactic and chemokirtetic activities (e.g., for treating 
infections, tumours); haemostatic or thrombolytic activity (e.g., for treating 
10 haemophilia, cardiac infarction etc.); ami-mflamrnatory activity (e.g., for treating 

septic shock, Crohn's Disease); as antimicrobials; for treating psoriasis or other 
> hyperproliferati ve disease; for regulation of metabolism, behaviour, and many others. 
Also contemplated is the use of the corresponding nucleic acid in gene therapy 
procedures. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:23 and may have been publicly available prior to conception' of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
25 cumbersome. Accordingly, preferably excluded from the present invention are one or 

1 5 more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1038 of SEQ ID NO:23, b is an 
integer of 15 to 1052, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:23, and where b is greater than or equal to a + 14. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 14 
35 Tne translation product of this gene shares sequence homology with 

, chromaffin granule amine transporter protein which is thought to be important in 
vesicle membrane amine transport, particularly in the neural and endocrine tissue, and 
the human vesicular monoamine transporter hVMATl which is involved in the 
regulation of amine storage in cardiovascular, endocrine, and central nervous system 
function (See, Genbank Accession Nos. giil314290 and gbl A AC50472. 1 1; all 
references available through these accession numbers are hereby incorporated by 
reference herein). Based on these sequence similarities, The translation product of 
this gene is expected to share at least some biological activities with amine transporter 
30 proteins. Such activities are known in the art, some of which are described in 
Erickson, et al., PNAS 93:5166-5171 (1996), and/or Lkuefal., Cell 70:539-551 
50 (1992), which are both incorporated herein by reference. 
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In a specific embodiment, polypeptides comprising the amiao acid sequence 
of the open reading frame upstream of the predicted signal peptide are contemplated 
by the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: GTSFLDPTLSLFVLEKFNLPAGYVGLVFLGMAL 
SYAISSPLFGLLSDICRPPLRKvVLLVFGNLrrAGCYMLLGPVPILHIKSQLWLL 
VLILVVSGI^AGMSIIPTFPEILSCAHENGFEEGLSTIXjLVSGLFSAMWSIGAF 
MGPTLCXjFLYEKIGFEWAAAIQGLWALK^ 

STEEERTTLLPNET (SEQ ID NO: 278). Polynucleotides encoding these 
polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
6. Accordingly, polynucleotides related to this invention are useful as a marker in * 
linkage analysis for chromosome 6. 

This gene is expressed primarily in colon cancer, osteoclastoma, and T-cell 
lymphoma, and to a lesser extent in many tumor or proliferative tissues such as 
endometrial tumor, chondrosarcoma, induced umbilical vein endothelial cells. 

Thercfore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. diseases resulting from disorders in small molecule transport (i.e., 
signalling molecules) in afflicted tissues and organs, particularly of the endocrine and 
central nervous systems. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the musculoskeletal, immune, and/or digestive systems 
and cancer expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues or cell types (e.g., neural, endocrine, or cancerous and 
wounded tissues) or bodily fluids (e.g., bile, lymph, serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 137 as residues: Ser-I 14 to Asn-123, Thr-127 to Thr- 
132. Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in colon cancer, osteoclastoma, and T-cell lymphoma 
and homology to amine transporter family members indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
disorders or diseases resulted from small molecule transport in afflicted tissues and 
organs, particularly that of colon, osteoclast or T-cells. The expression in cancer 
tissues, and shared homology with transporter proteins may also indicate its role in 
anti-cancer drug resistance. Additionaly, the protein can be used to determine 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands, fo 
identify agents that modulate their interactions and as nutritional supplements. It may 
also have a very wide range of biological activities although no evidence for any is 
provided in the specification. Typical of these are cytokine, ceil 
proliferation/differentiation modulating activity or induction of other cytokines; 
immunostimulating/immunosupprcssant activities (e.g. for treating human 
immunodeficiency virus infection', cancer, autoimmune diseases and allergy); 
regulation of haematopoicsis (e.g. for treating anaemia or as adjunct to 
chemotherapy); stimulation of growth of bone, cartilage, tendons, ligaments and/or 
nerves (e.g. for treating wounds, stimulation of follicle stimulating hormone (for 
control of fertility); chemotactic and chemokinetic activities (e.g. for treating 
infections, tumours); haemostatic or thrombolytic activity (e.g. for treating 
haemophilia, cardiac infarction etc.); anti-inflammatory activity (e.g. for treating 
septic shock, Crohn's Disease); as antimicrobials; for treating psoriasis or other 
hyperprolifcrative disease; or for identifying inhibitors or promoters of the transport 
of toxic molecules to vesicles, for regulation of metabolism, behaviour, and many 
others. Also contemplated is the use of the corresponding nucleic acid in gene therapy 
procedures. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:24 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 527 of SEQ ID NO:24, b is an 
integer of 1 5 to 1 541, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:24, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 15 

The translation product of this gene shares sequence homology with the 
human prolyl 4-hydroxyIase alpha (II) subunit which is important in catalyzing the ' 
formation of 4-hydroxyproline in collagens which is essential for the folding of newly 
synthesised collagen polypeptide chains into triple-helical molecules (See Genbank 
Accession No. gblAAB7l339.ll; all references available through this accession are 
hereby incorporated herein by reference). Based on the sequence similarity, the 
translation product of this gene is expected to share at least some biological activities 
with Prolyl 4-hydroxylase proteins. Such activities are known in the art, some of 
which are described in Annunen, etal., J. Biol. Chem. 272:17342-17348 (1997) 
which is incorporated herein by reference. 

When tested against U937 myeloid and Jurkat T-cell cell lines, supernatants 
removed from cells containing this gene activated the gamma activating sequence 
(GAS), a promoter element found upstream of many genes which are involved in the 
Jak-STAT pathway. The Jak-STAT pathway is a large, signal transduction pathway 
involved in the differentiation and proliferation of cells. Therefore, activation of the 
Jak-STAT pathway, reflected by the binding of the GAS element, can be used to 
indicate proteins involved in the proliferation and differentiation of cells. Thus, it is 
likely that this gene activates myeloid cells and T-cells through the Jak-STAT signal 
transduction pathway. 

In a specific embodiment, polypeptides comprising the amino acid sequence 
of the open reading frame upstream of the predicted signal peptide are contemplated 
by the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: 
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GTREARIJUDLTRFYDKVLSLHEDSTTPVANPLLA 

EASENIRALKDGYEKVEQDLPAFEDLEGAARALMRLQDVYMLNVKGLAR 

GVFQRVTGSAITDLYSPKRLFSLTGDDCFQVGKVAYDMGDYYHAIPWLEEA 

VSLFRGSYGEWKTEDEASLEDALDHLAPAYFRAGNVSCALSLSREFLLYSPD 

NKRMARNVLKYERLLAESPNHVVAEAVrQRPNIPHLQTRDTYEGLCQTL 

GSQPTLYQ1PSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQ 

KIRELAEPWLQRSVVASGEKQLQVEYRISFCSAWLKDTVDLKLVTLNHRIAA 

LTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFM 

IYI^SVEAGGATAFIYAJST1^\0>VVRNAALFWWNLTO 

VLVGDKWVANKWIHEYGQEFRRPCSSSPED (SEQ ID NO: 282). Additional, 

Preferred polypeptides comprise the following amino acid sequence: GTREA 
RLRDLTRFYDKVLSLHEDSTTPVANPLLAFTLIKRLQSDWRNVVHSLEASENI 
RALKDGYEKVEQ DLPAFEDLEGA ARAL (SEQ ID NO: 279), ALMRLQD (SEQ 
ID NO: 280), and/or VEAGGAT (SEQ ID NO: 28 1 ). Polynucleotides 
encoding these polypeptides are also provided. 

This gene is expressed primarily in lymph node breast cancer, colon 
carcinoma, and to a lesser extent in osteoblasts and adipocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders of connective and immune tissues, particularly autoimmune 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the connective tissues in breast, colon, bone, and fat, expression of this 
gene at significantly higher or lower levels is routinely detected in certain tissues or 
cell types (e.g., immune, connective, or cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 138 as residues: Ser-74 to Ala-84, Gin- 156 to tyr- 
161, Tyr-184 to Asn-189, Ser-218 to fle-223, Pro-299 to Ser-308, His-359 to Thr-368, 
Tyr-390 to Asp-404. Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in lymph node breast cancer and colon carcinoma and 
homology to prolyl 4-hydroxylase alpha (II) subunit indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for intervention of connective 
tissue disorders and diseases (e.g. arthritis, trauma, tendonitis, chrondomalacia and 
inflammation), as well as, in the diagnosis or treatment of various autoimmune 
disorders such as rheumatoid arthritis, lupus, scleroderma, and dermatomyositis as 
well as dwarfism, spinal deformation, andspecific joint abnormalities as well as 
chondrodysplasias ie, spondyloepiphyseal dysplasia congenita, familial osteoarthritis, 
Atelosteogenesis type n, metaphyseal chondrodysplasia type Schmid. Alternatively, 
the tissue distribution within various tissue carcinomas and tumor tissues, and 
biological activity reflected by the binding and activation of the GAS promoter 
element indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for the diagnosis and treatment of cancer and other proliferative disorders. 

Expression in cellular sources marked by proliferating cells indicates that this 
protein may play a role in the regulation of cellular division. Similarly, embryonic 
development also involves decisions involving cell differentiation and/or apoptosis in 
pattern formation. Thus this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:25 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 2065 of SEQ ID NO: 25, b is an 
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integer of 15 to 2079, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:25, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 16 

In an additional embodiment, polypeptides comprising the amino acid 
sequence of the open reading frame upstream of the predicted signal peptide are 
contemplated by the present invention. Specifically, polypeptides of the invention 
comprise the following amino acid sequence: IQPSHAALLHCRSTFRKTECLDPW 
WVRRQLLGMAGIGGLQKMKAPHTGVLm^ 
SGCMSTVRWLNSNITANRTLSRSVCH 

AGEAGSWGY (SEQ ID NO: 283). Polynucleotides encoding these polypeptides are 
also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
20. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 20. 

This gene is expressed primarily in fetal liver, and, to a lesser extent, in a 
variety of fetal and other tissues and cell types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, liver disorders and cancers (e.g., hepatoblastoma, hepatitis, liver 
metabolic diseases and conditions that are attributable to the differentiation of 
hepatocyte progenitor eel Is). Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the liver, expression of this gene at significantly higher 
or lower levels is routinely detected in certain ussues or ceil types (e.g., hepatic, or 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, bile, serum, plasma, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, nelarive to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 139 as residues: Ser-67 to Tyr-75. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in fetal liver indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for detection and treatment of liver 
disorders and cancers (e.g., hepatoblastoma, jaundice, hepatitis, liver metabolic 
diseases and conditions that are attributable to the differentiation of hepatocyte 
progenitor cells). In addition the expression in fetus would suggest a useful role for 
the protein product in developmental abnormalities, fetal deficiencies, pre-natal 
disorders and various would-healing models and/or tissue trauma. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:26 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b. where a is any integer between I to 1933 of SEQ ID NO:26, b is an 
integer of 15 to 1947, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:26, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 17 

The translation product of this gene shares sequence homology with human 
laminin Bl which is thought to be an important structural extracellular matrix 
component involved in cell migration and signalling, paricularly in stimulating 
epithelial cell growth and differentiation (See, Gcnbank Accession No gill 86837). 
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Preferred polypeptides of the invention comprise the following amino acid 
sequences: CSSPKJRLPWCWTAPRTLGKHGSLISTLRLTAPLHLAWKMMLS. 
RKALFVLLNTPVLFHALEGRLFSKLCHHHTIQRTLTVPKFRSS (SEQ ID NO: 
284), RSPTSRVQLLKRQSCPCQRNDLNEEPQHFTHYAIYDFIVKGSCFCNG 
HADQCIPVHGFRPVKAPGTFHMVHGKCM {SEQ ID NO: 285), and/or HNTAG 
SHCQHCAPLYNDRPWEAADGKTGAPNECRTCKCNGHADTCHFDVNVWEAS 
GNRSGGVCDDCQHNTEGQYCQRCKPGFYRDLRRPFSAPDACKPCSCHPV 
GSAVLPANSVTFCDPSNGDCPCKPGVAGRRCDRCMVGYWGFGDYGCRP 
CDCAGSCDPITGDQSSHTDIDWYHEVPDFRPVHNKSEPAWEWEDAQGFSAL 
LHSGKCECKEQTLGNAKAFCGMKYSYVLKJ^ 

KVLKSTKLKJFRG KA NI IS RIM DGQ RM HLS N PQS W FG I PCS RT (SEQ ID NO: 
286). Polynucleotides encoding these polypeptides are also provided. 

Included in this invention as preferred domains are Laminin-type EGF-like 
(LE) domain signatures, which were identified using the ProSite analysis tool (Swiss 
Institute of Bioinformatics). Laminins are the major noncollagenous components of 
basement membranes that mediate cell adhesion, growth migration, and 
differentiation. They are composed of distinct but related alpha, beta and gamma 
chains. The three chains form a cross-shaped molecule that consist of a long arm and 
three short globular arms. The long arm consists of a coiled coit structure contributed 
by all three chains and cross-linked by interchain disulfide bonds. Beside different 
types of globular domains each subunit contains, in its first half, consecutive repeats 
of about 60 amino acids in length that include eight conserved cysteines. The tertiary 
structure of this domain is remotely similar in its N-terminal to that of the EGF-like 
module. It is known as a 'LE T or 'laminin-type EGF-like' domain. The number of 
copies of the LE domain in the different foims of laminins is highly variable; from 3 
up to 22 copies have been found. A schematic representation of the topology of the 
four disulfide bonds in the LE domain is shown below. 
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'C: conserved cysteine involved in a disulfide bond 
'a': conserved aromatic residue 

' G': conserved glycine (lower case = less conserved) 
' s*: region similar to the EGF-like domain 
* * ' : position of the pattern 

In mouse laminin gamma- 1 chain, the seventh LE domain has been shown to 
be the only one that binds with a high affinity to nidogcn. The binding-sites arc 
located on the surface within the loops C1-C3 and C5-C6. Long consecutive arrays of 
LE domains in laminins form rod-like elements of limited flexibility, which determine 
the spacing in the formation of laminin networks of basement membranes. We 
derived a signature pattern for the LE domain which covers the C-terminal half of the 
repeat starting with the fourth conserved cysteine. The consensus pattern is as 
follows: C-A(l,2)-C-x(5>-G-x(2)-C-x(2)-C-x(3 t 4)-!FYWl-x(3,l5)-C (All Cs are 
involved in disulfide bonds) 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: CDDCQHNTEC QY CQRCKPG FY RDLRRPFS APD ACKPC (SEQ ID 
NO: 287) and/or CPCKPGVAGRRCDRCMVGYWGFGDYGCRPCDCAGSC (SEQ 
ID NO: 288). Polynucleotides encoding these polypeptides are also provided- 
Further preferred are polypeptides comprising the lamin in-type EGF-like 
domains listed above, and at least 5, 10, 15, 20, 25, 30, 50, or 75 additional 
contiguous amino acid residues of the sequence encoded by this gene. The additional 
contiguous amino acid residues is N-terminal or C- terminal to the laminin-type EGF- 
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like domain. Alternatively, the additional contiguous amino acid residues is both N- 
terminal and C-termiaal to the laminin-type EGF-like domain, wherein the total N- 
and C-terminal contiguous amino acid residues equal the specified number. The 
above preferred polypeptide domain is characteristic of a signature specific to 
Laminin proteins. Based on the sequence similarity, the translation product of this 
gene is expected to share at least some biological activities with Laminin proteins. 
Such activities are known in the art, some of which are described elsewhere herein. 

This gene is expressed primarily in osteoblastic tissues and cell types, 
including osteoblasts, osteoblastomas and osteoclastomas. Expression is also 
abundant in vascular-pulmonary tissues such as lung, micro-vasculature, pulmonary, 
etidoithelial and smooth muscle cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer and malignancies (particularly of osteoblastic tissues and 
rhabdomyosarcoma), as well as cardiovascular and respiratory or pulmonary disorders 
such as athsma, pulmonary edema, pneumonia, atherosclerosis, restenosis, stoke, 
thrombosis hypertension, inflammaiion and wound healing. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell rype(s). For a number of 
disorders of the above tissues or cells, particularly of the cardiorespiratory system, 
and skeletal system expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., skeletal, osteoblast, cardio- 
respiratory, vascular, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
pulmonary surfactant, serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or ceil sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SFQ ID NO: 140 as residues: Ser-28 to Cys-34, Thro I to Thr-58, 
Tyr-64 to Asn-8 1, Asp-1 1 1 to Lys-1 16, Asp-145 to Phe-160, Pro-203 to Glu-217. 
Polynucleotides encoding said polypeptides are also provided. 
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The tissue distribution in osteoblastic tissues and cell types and homology to 
laminin indicates that polynucleotides and polypeptides corresponding to this gene are 
useful for the treatment, prevention and diagnosis of cardiovasular and respiratory or 
pulmonary disorders such as asthma, pulmonary edema, pneumonia, atherosclerosis, 
restenosis, stoke, angina, thrombosis hypertension, inflammation and wound healing. 
As a homolog of laminin, this gene product quite possibly has a role in cell adhesion, 
migration, proliferation, angiogenesis, chondrogenesis, wound healing and 
oncogenesis. An EST (Int J Cancer 1996 May 16;66(4):571-577) with an identical 
sequence to part of this contig was shown to be differentially expressed in human 
primary myoblasts and embryonal rhabdomyosarcoma and therefore might have an 
important role in the determination or maintenance of the normal phenotype, and thus 
its loss is possibly involved in the progression of malignancies, particularly of skeletal 
muscle. Similarly, the homology to a laminin would suggest a role in the detection 
and treatment of disorders and conditions afflicting connective tissues (e.g. arthritis, 
trauma, tendonitis, chrondomalacia and inflammation) in the diagnosis or treatment of 
various autoimmune disorders such as rheumatoid arthritis, lupus, scleroderma, and 
dcrmatomyositis as well as dwarfism, spinal deformation, and specific joint 
abnormalities as well as chondrodysplasias i.e. spondyloepiphyseal dysplasia 
congenita, familial osteoarthritis, Atelosteogenesis type II, metaphyseal 
chondrodysplasia type Schmid. Furthermore, the protein may also be used to 
determine biological activity, to raise antibodies, as tissue markers, to isolate cognate 
ligands or receptors, to identify agents that modulate their interactions, in addition to 
its use as a nutritional supplement. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:27 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
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formula of a-b, where a is any integer between I to 3365 of SEQ ID NO:27, b is an 
integer of 15 to 3379, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:27, and where b is greater than or equal to a f 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 18 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
10. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 10. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the ' 
following amino acid sequence: NISSQYCILKSLEMMISGLKLLVLFLKFAPENY 
CLSTETLQMPNRHLRLS KATC YLM KCLLPS YFE (SEQ ID NO: 289). 
Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in placenta, brain, and to a lesser extent, in a 
variety of other tissues and cell types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, reproductive, behavioral, or nervous system disorders, such as: 
depression, schizophrenia, Alzheimer's Disease, Parkinson's Disease, Huntington's 
Disease, dementia, paranoia, addictive behavior, epilepsy, transmissible spongiform 
encephalopathy (TSE), Creutzfeldt-Jakob disease (CJD). Other diseases and 
conditions related to expression in the placenta might include developmental 
anomalies and fetal deficiencies, ovarian and endometrial cancers, reproductive 
disfunction and pre-natal disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the central nervous and reproductive systems, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., neural, reproductive, or cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, amniotic fluid, serum, plasma, urine, synovial 
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fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 141 as residues: Ala-16 to Leu-22. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in brain indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions. 
Representative uses are described in the "Regeneration" and "Hyperproliferative 
Disorders" sections below, in Example 11, 15. and 18, and elsewhere herein. Briefly, 
the uses include, but are not limited to the detection, treatment, and/or prevention of 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tpurette Syndrome, 
meningitis, encephalitis, demyelinaung diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, expression in placenta would suggest a 
possible role in the treatment and diagnosis of developmental anomalies and fetal 
deficiencies, ovarian and endometrial cancers, reproductive disfunction and pre-natal 
disorders. 

Similarly, expression within embryonic tissue and other cellular sources 
marked by proliferating cells indicates that this protein may play a role in the 
regulation of cellular division. Similarly, embryonic development also involves 
decisions involving cell differentiation and/or apoptosis in pattern formation. Thus/ 
this protein may also be involved in apoptosis or tissue differentiation and could again 
be useful in cancer therapy. Furthermore, the protein may also be used to determine 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate their interactions, in addition to its use as a 
nutritional supplement. Protein, as well as, antibodies directed against the protein may 
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show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:28 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1992 of SEQ ID NO:28, b is an 
integer of 15 to 2006, where both a and b correspond to the positions of nucleotide* 
residues shown in SEQ ID NO:28, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 19 

The translation product of this gene shares sequence homology with the 
murine transforming protein (See, e.g., Genbank Accession No. 
gil53529lcmblCAA36859. II; all references available tlirough this accession are hereby 
incorporated by reference herein}. 

In a specific embodiment, polypeptides comprising the amino acid sequence 
of the open reading frame upstream of the predicted signal peptide are contemplated 
by the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: PIEGTPAGTGPEFPGRPTRPQRMRSLISSHPCQ 
HIXLLLIJLLFLILAtLVDVKWYLVmCISLMTSDVEHLFMCLLMRlSSWR 
NVY (SEQ ID NO: 290). Polynucleotides encoding these polypeptides are also 
provided. 

This gene is expressed primarily in activated and basal T-celLs. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immunodeficiency, tumor necrosis, infection, lymphomas, auto- 
immunities, cancer, metastasis, wound healing, inflammation, anemias (leukemia) and 
other hematopoietic disorders. Similarly, polypeptides and antibodies directed to these 
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polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., immune, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in activated T-cells and the homology to a murine 
transforming protein indicates polynucleotides and polypeptides corresponding to Ihis 
gene are useful for the diagnosis and treatment of a variety of immune system 
disorders. Representative uses are described in the "Immune Activity" and "infectious 
disease" sections below, in Example 11,13, 14, 16, 18, 19, 20, and 27, and elsewhere 
herein. Briefly, the expression of this gene product indicates a role in regulating the 
proliferation; survival; differentiation; and/or activation of hematopoietic cell 
lineages, including blood stem cells. This gene product is involved in the regulation of 
cytokine production, antigen presentation, or other processes suggesting a usefulness 
in the treatment of cancer (e.g., by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatou s Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-grati and graft- versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyeli nation, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
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various blood lineages, and in the differentiation and/or proliferation of various cell 
types. 

The secreted protein can also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate igands or receptors, to identify agents 
that modulate their interactions and as nutritional supplements. It may also have a 
very wide range of biological activities although no evidence for any is provided in 
the specification. Typical of these are cytokine, cell proliferation/differentiation 
modulating activity or induction of other cytokines; 
inimunostimulating/immunosuppressant activities (e.g. for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of haematopoiesis (e.g. for treating anaemia or as adjunct to 
chemotherapy); stimulation of growth of bone, cartilage, tendons, ligaments and/or 
nerves (e.g. for treating wounds, stimulation of follicle stimulating hormone (for 
control of fertility); chemotaclic and chemokinetic activities (e.g. for treating 
infections, tumours); haemostatic or thrombolytic activity (e.g. for treating 
haemophilia, cardiac infarction etc.); anti-inflammatory activity (e.g. for treating 
septic shock, Crohn's Disease); as antimicrobials; for treating psoriasis or other 
hyperproliferative disease; for regulation of metabolism, behaviour, and many others. 
Also contemplated is the use of the corresponding nucleic acid in gene therapy 
procedures. Furthermore, the protein may also be used to determine biological 
activity, raise antibodies, as tissue markers, to isolate cognate 1 igands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:29 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specificallv 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between .1 to 3056 of SOQ ID NO:29, b is an 
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integer of 15 to 3070, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:29, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 20 

The translation product of this gene was shown to have homology to the Mus 
muscuius ALG-2 protein, which is known to code for a Ca(2+)-binding protein 
required for T cell receptor-, Fas-, and glucocorticoid-induced cell death. ALG-2 
mediate Ca(2+)-regulated signals along the death pathway and may play a role in the 
onset of Alzheimer's Disease (See e.g., Genbank Accession No.gil 1 213520; all 
references available through this accession are hereby incorporated by reference 
herein). 

Preferred polypeptides comprise the following amino acid sequence: NWVPT 
CLCPS APCS FHLLS RFKCLFSPQRLTDIFRR YDTDQDGW1Q VS YEQ YLSMVFS 
IV (SEQ ID NO: 29 J), and/or QRLTDIFRRYDTDQDG WIQVSYEQYLSMVFSIV 
(SEQ ID NO: 292). Polynucleotides encoding these polypeptides are also provided. 

When tested against K562 cell lines, supernatants removed from cells 
containing this gene activated the ISRE (interferon -sensitive responsive element). 
Thus, it is likely that this gene activates immune or leukemia cells through the Jaks- 
STAT signal transduction pathway. ISRE is a promoter element found upstream in 
many genes which are involved in the Jaks-STAT pathway. The Jaks-STAT pathway 
is a large, signal transduction pathway involved in the differentiation and proliferation 
of cells. Therefore, activation of the Jaks-STATs pathway, reflected by the binding of 
the ISRE element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MFYKLTLILCELSVAGVTQAASQRPLQRLPRH1CSQR 
XPPGRCLLKAXLQTTWXXPD KPT PRLSPPLXSDPICR (SEQ ID NO: 293). 
Polynucleotides encoding these polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
5. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 5. 



WO 00/06698 



57 



PCT/US99/I7I30 



This gene is expressed primarily in placenta, and to a lesser extent, in a variety 
of other tissues and cell types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and. for diagnosis of diseases and conditions which include, but are 
not limited to, developmental anomalies, fetal deficiencies ovarian and endometrial 
cancers, reproductive dysfunction and pre-natal disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the reproductive system, 
expression of this gene at significantly higher or lower. levels is routinely detected in ' 
certain tissues or cell types (e.g., reproductive, developmental, or cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, amniotic fluid, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 143 as residues: Arg-24 to Arg-3 1 , Ile-33 to Gly-4 1 . 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in placenta indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment, prevention 
and/or diagnosis of developmental anomalies, fetal deficiencies, ovarian and 
endometrial cancers, reproductive dysfunction and pre-natal disorders. Expression 
within embryonic tissue and other cellular sources marked by proliferating cells 
combined with the observed ISRE activity, and homology to the apoptosis linked. 
ALG-2 indicates that this protein may play a role in the regulation of cellular division, 
and may show utility in the diagnosis, treatment, and/or prevention of developmental 
diseases and disorders, cancer, and other proliferative conditions. Representative uses 
are described in the "Hyperproliferative Disorders'* and "Regeneration" sections 
below and elsewhere herein. Briefly, developmental tissues rely on decisions 
involving cell differentiation and/or apoptosis in pattern formation. . 
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Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential ro.es in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of - 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplemenL Protein, as well as, 
antibodies directed against the protein may show utility as a tumormarker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:30 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 2213 of SEQ ID NO:30, bis an 
integer of 15 to 2227, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:30, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 21 

The translation product of this gene was shown to have homology to the 
human histo-blood group A transferase (See, e.g., Gen bank Accession No. 
gblAAD26573.1IAF1344!3_l (AFI34413); all references available through this 
accession are hereby incorporated by reference herein) which is known to represent 
one of the major allogeneic antigens in both erythrocytes and tissues of humans. Its 
been proposed that the A phenotype is associated with the glycosyltransferas that 
converts the H substance associated with the O phenotype to A through the addition 
of alphal-3-N-acetylgalactosamine oraIphal-3-galactosyl residues to the H antigen 
Fuc-alphal -2Gal - beta 1 -R, Therefore, the primary product of the histo-blood group A 
is its respective glycosyltransferase. Preferred polypeptides of the invention comprise 
the following amino acid sequence: TSSPV FSFCS M A V REPDHLQ 

RVSLPRYNVSASLQWLPCHRIVLQPWHMCAMWELGQVLFHPVAPREGAAPS 
PVSTLTWPSSCS HSESTM ELELQF (SEQ ID NO: 294), LPCHRIV (SEQ ID NO: 
296), SLQWLPCHRIVLQPW (SEQ ID NO: 297), and/or MA VREPDHLQRVSLPR 
(SEQ ID NO: 295). Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in 12-week-oId human embryo. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental anomalies, fetal deficiencies, pre-natal disorders, 
hematopoietic disorders, or cancer. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the developing fetus, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., hematopoietic, lymph, developing, or cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, amniotic fluid, serum, plasma, urine, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue or bodily fluid from an individual not having the disorder. 
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The tissue distribution in 12 week old embryo indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for the treatment and diagnosis 
of developmental anomalies, fetal deficiencies, pre-natal disorders and cancer. 
Similarly, expression within embryonic tissue and other cellular sources marked by 
proliferating cells indicates that this protein may play a role in the regulation of 
cellular division. Similarly, embryonic development also involves decisions involving 
cell differentiation and/or apoptosis in pattern formation. Thus, this protein may also 
be involved in apoptosis or tissue differentiation and could again be useful in cancer 
therapy. Alternatively, the tissue distribution and homology to human blood group A 
and B glycosyltransferase enzymes indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and diagnosis of hematopoietic 
related disorders such as anemia, pancytopenia, leukopenia, thrombocytopenia or 
leukemia since stroma] cells are important in the production of cells of hematopoietic 
lineages. Representative uses are described in the "Immune Activity" and "infectious 
disease" sections below, in Example 11, 13, 14, 16, 18, 19, 20, and 27, and elsewhere 
herein. Briefly, the uses include bone marrow cell ex-vivo culture, bone marrow 
transplantation, bone marrow reconstitution, radiotherapy or chemotherapy of 
neoplasia. 

The gene product may also be involved in lymphopoiesis, therefore, it can be 
used in immune disorders such as infection, inflammation, allergy, immunodeficiency 
etc. In addition, this gene product may have commercial utility in the expansion of 
stem cells and committed progenitors of various blood lineages, and in the 
differentiation and/or proliferation of various cell types. Furthermore, the protein may 
also be used to determine biological activity, to raise antibodies, as tissue markers, to 
isolate cognate ligands or receptors, to identify agents that modulate their interactions, 
in addition to its use as a nutritional supplement. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 3 1 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
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excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 1274ofSEQ IDNO:3Lbisan 
integer of 15 to 1288. where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:3I, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 22 

The translation product of this gene shares sequence homology with CD97 
(EMR1), which is thought to be important in both adhesion and signaling processes . 
early after leukocyte activation (See, e.g., Genbank Accession No. gil784994; all 
references available through this accession are hereby incorporated by reference 
herein). EMRl belongs to a novel family of G-protein receptors that has recently been 
recognized on the basis of homologous primary amino acid sequences, comprises 
receptors to hormones of the secretin/vasoactive intestinal peptide/glucagon family, * 
parathyroid hormone and parathyroid hormone -related peptides, growth hormone 
releasing factor, corticotropin-releasing factor, and calcitonin. Proteins with seven 
transmembrane segments (7TM) define a superfamily of receptors (TTMreceptors) 
sharing the same topology: an extracellular N-terminus, three extra membranous loops 
on either side of the plasma membrane, and a cytoplasmic C-terminal tail. Upon 
ligand binding, cytoplasmic portions of the activated receptor interact with 
heterotrimeric G-coupIed proteins to induce various second messengers, which 
subsequently activate various signal transduction pathways depending upon the 
specific G-coupled protein associated with the receptor. Preferred polypeptides of the 
invention comprise the following amino acid sequence: CFKRKPKREHCSCP 
ITYQSIXJDILNASrTSKRKGMQEVKLNSYVVSGTIGLKEKISLSEPVFLTFRHN 
QPGD1CRTKHICVYWEGSEGGRWSTEGCSHVHSNGSYTKCKCFHLSSFAVLV 
ALAPKEDPVLTVITQVGLTlSLLCLFLAILTFLLCRPIQNTSTSLHLELSLCLnA 
HLLr^TGINRTEPEVLCSHAGLLHrl.YLACrTWMLLEGLHLFLTVRNLKVAN 
YTSTGRHCKRFMYPVGYGIPAVIlAVSAIVGPQNYGTFTHCWLKLDKGnWSF 
MGPVAVIILINLVr^FQVLWrLRSKLSSLNKEVSTlQDTRVXn-FKAlSQLnLGC 
SWGLGFFMVEEVGKTIGS1IAYSFTIINTLQGVLLFVVHCLLNRQVRMEYKKW 
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FSGMRKGVETESTEMSRSTTQTKTEEVGKSSEIBIKGGTASSSAESTKQPQPQ 
VHLVSAA WLKMN (SEQ ID NO: 298), and/or FFWKENLRRNGSREDFARRATQ 
LIQSVELSIWNASFASPGKGQISEFDIVYETKRCNETRENAFLEAGNNTMD1NC 
ADALKGNLRESTAV ALSLINLLG1F SEQ ID NO; 299. Polynucleotides 
encoding these polypeptides are also provided. 

Included in this invention as preferred domains are two EGF-Iike protein 
domains, which were identified using the ProSite analysis tool (Swiss Institute of 
Bioinformatics). First, a sequence of about forty ami no-acid residues long found in 
the sequence of epidertjaaJ growth factor (EGF) has been shown to be present in a 
large number of membrane-bound and extracellular, mostly animal proteins. Many of 
these proteins require calcium for their biological function and a calcium-binding site 
has been found to be located at the N-terminus of some EGF-Iike domains. Calcium- 
binding is crucial for numerous protein -protein interactions. We have used the N- 
terminal part of the EGF domain as a consensus pattern. It includes the negative N- 
terminus and the possible hydroxylation site. The consensus pattern is as 
follows: fDEQN|.[DEQN]{2}C{3,l4>C{3,7>CrDNJ.{4}fFY].C [The four Cs are 
involved in disulfide bonds |. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: DINECErGLAKCKYKAYCRNKVGGYIC (SEQ ID NO: 300). 
Polynucleotides encoding these polypeptides are also provided. Secondly, post- 
translation^ hydroxylation of aspartic acid or asparagine to form erythro-beta- 
hydroxyaspartic acid or erythro-beta-hydroxyasparagine has been identified in a 
number of proteins with domains homologous to (EGF). Based on sequence 
comparisons of the EGF-homology region that contains hydroxylatcd Asp or Asn, a 
consensus sequence located in the N-terminal of EGF-Iike domains has been 
identified that seems to be required by the hydroxylase(s). The consensus sequence is 
as follows: C.fDN].{4}[FY|.C.C 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: CRNKVCGYICSC (SEQ ID NO: 301). Polynucleotides encoding these 
polypeptides are also provided. 

Further preferred are polypeptides comprising the calcium-binding EGF-Iike 
domain and aspartic acid and asparagine hydroxylation site listed above, and at least 
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5, 10, 15, 20, 25, 30, 50. or 75 additional contiguous amino acid residues of the 
sequence referenced in Table r for this gene and the embodiments listed herein. The 
additional contiguous amino acid residues is N-terminal or C- terminal to one or both 
of the listed domains. Alternatively, the additional contiguous amino acid residues is 
both N-terminai and C- terminal to one or both of the listed domains, wherein the total 
N- and C-terminal contiguous amino acid residues equal the specified number. The 
above preferred polypeptide domains are characteristic of a signature specific to EGF 
like proteins. Based on the sequence similarity and conserved domains, The 
translation product of this gene is expected to share at least some biological activities 
with EGF-like proteins. Such activities are known in the art, some of which are 
described elsewhere herein. 

This gene is expressed primarily in eosinophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, hematopoietic disorders or anemias and leukemias, 
immunodeficiencies, infection, lymphomas, auto-immunities and cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune and 
hematopoietic systems, expression of this gene at significantly higher or lower levels 
is routinely detected in certain tissues or cell types (e.g., immune, hematopoietic, or 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or ceil sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e.. 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 145 as residues: Ser-22 to Ser-30, Pro-33 to Cys-48, 
Asp-50 to Lys-67, Pro-1 17 to Ser-130. Polynucleotides encoding said polypeptides 
are also provided. 
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The tissue distribution in eosinophils combined with its homology to a known 
human seven transmembrane domain protein indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
cancer and other proliferative disorders, particularly considering the fact that the 
majority of 7 transmembrane receptors are tightly associated with signal transduction 
pathways which axe integral to the modulation of the cell cycle. As such, the protein 
product of this gene may play a role in the regulation of cellular division, where loss 
of regulation may result in proliferating cells and the onset of tumors or cancer. 
Additionally, the expression in hematopoietic cells and tissues indicates that this 
protein may play a role in the proliferation, differentiation, and/or survival of 
hematopoietic cell lineages. In such an event, this gene is useful in the treatment of ' 
lymphoproliferative disorders, and in the maintenance and differentiation of various 
hematopoietic lineages from early hematopoietic stem and committed progenitor 
cells. Similarly, embryonic development also involves decisions involving cell 
differentiation and/or apoptosis in pattern formation. Thus this protein may also be 
involved in apoptosis or tissue differentiation and could again be useful in cancer 
therapy. Similarly, the tissue distribution and homology to CD97 indicates that the 
protein product of this gene might be a marker for differentiation and activation of 
eosinophils, and therefore is useful for the diagnosis and treatment of immune 
disorders including: leukemias, lymphomas, auto-immunities, immunodeficiencies 
(e.g., AIDS), immuno-supressive conditions (transplantation) and hematopoietic 
disorders. Representative uses are described in the "Immune Activity" and "infectious 
disease" sections below, in Example 11. 13, 14, 16. 18, 19, 20, and 27, and elsewhere 
herein. In addition this gene product is applicable in conditions of general microbial 
infection, inflammation or cancer. Furthermore, the protein may also be used to 
determine biological activity, to raise antibodies, as tissue markers, to isolate cognate 
ligands or receptors, to identify agents that modulate their interactions, in addition to 
its use as a nutritional supplement. Protein, as well as, antibodies directed against the 
protein may show utility as a lumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, arc publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:32 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 3266 of SEQ ID NO:32, b is an 
integer of 15 to 3280, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:32, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 23 

The translation product of this gene has been found to have homology to the' 
rat neural F box protein NFB42, in addition to a conserved Caenorhabditis elegans 
C14B1.3 protein (See, e.g., Genbank Accession Nos. gil3851648lgbfAAC97505.il 
(AF098301) and gi!558270; all references available through these accessions are 
hereby incorporated by reference herein). Preferred polypeptides of the invention 
comprise the following amino acid sequence: ALCPHPHLILNVTVSPAPSCRHVK 
KVVASPSPSTTMIAMDAPHSKAALDSINEIPENILI^ 

VCSLWRDLIDLMTLWKRKCLREGFITKDWT)QPVADWKIFYFLRSLHRNLLR 

NPCAEEDMFAWQIDFNGGDRWKVESLPGAHGTDFPDPKVKKYFVTSYEMCL 

KSQLVDLVAEGYWEELLDTFRPDIVVKDWFAARADCGCTYQLKVQLASA 

DYT^ASFEPPPVTIQQWNNATWTEVSYTFSDYPRGVRYILFQHGGRDTQY 

WAGWYGPR\TNSSIVVSPKMTRNQASSEAQPGQfCHGQEEAAQSPYRAVV 

QIF (SEQ ID NO: 302). Polynucleotides encoding these polypeptides are also 

provided. 

The gene encoding the disclosed cDN A is believed to reside on chromosome 
i. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 1. 

This gene is expressed primarily in immune cells, especially primary dendritic 
cells, and T cells, and to a lesser extent in a variety of other tissues including breast, 
keratinocytes, epididiymus (cauda), lung, multiple sclerosis, endometrial stromal 
cells, IL4 induced umbilical vein endothelial cells, fetal kidney, fetal dura mater, 
rejected kidney, and osteoblasts. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer and other proliferative disorders, particularly of the immune 
system or endothelial cells. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., immune, or cancerous and wounded tissues) or bodily Quids (e.g., lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 146 as residues: Pro-41 to Cys-47, Phe-52 to Gly-59, 
Pro-62 to His -70. Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in immune cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of a 
variety of immune system disorders. Representative uses are described in the 
"Immune Activity" and "infectious disease" sections below, in Example 11, 13, 14. 
16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the expression of this gene 
product in T-cells indicates a role in regulating the proliferation; survival; 
differentiation; and/or activation of hematopoietic cell lineages, including blood stem 
cells. This gene product is involved in the regulation of cytokine production, antigen 
presentation, or other processes suggesting a usefulness in the treatment of cancer 
(e.g., by boosting immune responses). 

Since the gene is expressed in ceils of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities. 
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such as T-ccll mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft- versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyeiination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or chat 
recruits hematopoietic ceils to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. 

The secreted protein can also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate igands or receptors, to identify agents 
that modulate their interactions and as nutritional supplements. It may also have a 
very wide range of biological activities although no evidence for any is provided in 
the specification. Typical of these are cytokine, ceil proliferation/differentiation 
modulating activity or induction of other cytokines; 

irnmunostimulating/immunosupprcssant activities (e.g., for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of haematopoiesis (e.g., for treating anaemia or as adjunct to 
chemotherapy); stimulation of growth of bone, cartilage, tendons, ligaments and/or 
nerves (e.g., for treating wounds, stimulation of follicle stimulating hormone (for 
control of fertility); chemotactic and chemokinetic activities (e.g. for treating 
infections, tumours); haemostatic or thrombolytic activity (e.g., for treating 
haemophilia, cardiac infarction etc.); anti-inflammatory activity (e.g., for treating 
septic shock, Crohn's Disease); as antimicrobials; for treating psoriasis or other 
hyperproliferative disease; for regulation of metabolism, behaviour, and many others. 
Also contemplated is the use of the corresponding nucleic acid in gene therapy 
procedures. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 33 and may have been publicly available prior to conception of 
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the present invention Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1283 of SEQ ID NO:33, b is an 
integer of 15 to 1297, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:33. and where b is greater than or equal to a +14. 

FEATURES OF PROTEJN ENCODED BY GENE NO: 24 

The translation product of this gene shares sequence homology with the 
human, mouse, and bovine dopamine hydroxylase which is thought to be important in 
the modification of dopamine, a neurotransmitter (See Cenbank Accession Nos. 
gil30474, gil 162965, and/or gil2358082; all references available through these 
accessions are hereby incorporated by reference herein). Preferred polypeptides of the 
invention comprise the following amino acid sequence: RQRSWNPGT 

NCYHPNMPDAFLTCETVIFAWAIGGEGFSYPPHVGLSLCTPLX)PHYVLLEVH 

YDNPTYEEGLIDNSGLRLFYTMDIRKYDAGVIEAGLWVSLFHTIPPGMPEF 

QSEGHCrLECLEEALEAEKPSGIHVFAVLLlIAHLAGRGIRLRHFRKGICEMKL 

LAYDDDFDFNFQEFQYLKEEQTILPGDNLITECRYNTKDRAEMTWGGLSTR 

SEMCLSYLLYYPRINLTRCASIPDIMEQLQFIGVKEIYRPVTTWPFIIKSPKQYK 

NLSFMDAMNKFKWTKKEGLSFNKLVLSLPVNVRCSKTDNAEWSIPRNDSrr 

SRYRKTL (SEQ ID NO: 303). Polynucleotides encoding these polypeptides are also 

provided. 

A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MCCWPLLLLWGLLPGTAAGGSGRTYPHRTLLDSEGK 
YWLGWSQRGSQIAFRLQVRTAGYVGFGFSPTGAMASADIVVGGVAHGR 
PYLQDY FTNANRELK KDAQQDY HLEYAMENSTHTIIEFTRELHTCDINDKS 
iTDSTVRVI WAYHHE DAGEAGPKYHDSNRGTKSLRLLNPEKTSVLSTALPYF 

DLVNQDVPIPNKDTTYWCQMFKIPVFQEKHHVIKVEPVIQRGHESLVHHILL 
YQCSNNFNDSVPGIRARIAITPTCPMHSSPV KL(SEQ ID NO: 304). 
Polynucleotides encoding these polypeptides are also provided. 
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This gene is expressed primarily in brain, the pulmonary system, and to a 
lesser extent in kidney. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neurological and behavioral disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the nervous system, expression 
of this gene at significantly higher or lower levels is routinely detected in certain 
tissues or cell types (e.g., neural, endocrine, or cancerous and wounded tissues) or* 
bodily fluids (e.g., sputum, lymph, serum, plasma, urine, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 147 as residues: Ser-33 to Trp-38, Gly-40 to Gly-45, 
Asn-93 to Asp-105, Thr-128 to Thr-137, Glu-150 to Lys-167, Pro-197 to Tyr-203, 
Cys-242 to Asn-247, Ser-253 to Tyr-258. His-307 to GIu-3 14, GIu-357 to Gly-362, 
Trp-373 to Gln-378, Ser-402 to Glu-408. Polynucleotides encoding said polypeptides 
are also provided. 

The tissue distribution in brain and homology to a protein involved in the 
modification of dopamine indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions. 
Representative uses are described in the "Regeneration" and "Hyperproiiferative 
Disorders" sections below, in Example 1 1, 15. and 18, and elsewhere herein. Briefly, 
the uses include, but are not limited to the detection, treatment, and/or prevention of 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourctte Syndrome, 
meningitis, encephalitis, demyeiinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
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compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, the gene or gene product may also play 
a role in the treatment and/or detection of developmental disorders associated with the 
developing embryo, sexually- linked disorders, or disorders of the cardiovascular 
system. Alternatively, the homology 10 dopamine hydroxylase indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
detection, treatment, and/or prevention of various endocrine disorders and cancers, 
particularly Addison's Disease, Cushing's Syndrome, and disorders and/or cancers of 
the pancreas (e.g., diabetes mellitus), adrenal cortex, ovaries, pituitary (e.g., hyper-, 
hypopituitarism), thyroid (e.g.. hyper-, hypothyroidism), parathyroid (e.g., hyper-' 
hypoparathyroidism), hypothalamus, and testes. Furthermore, the protein may also 
be used to determine biological activity, to raise antibodies, as tissue markers, to 
isolate cognate ligands or receptors, to identify agents that modulate their interactions, 
in addition to its use as a nutritional supplement. Protein, as well as T antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 34 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b. where a is any integer between 1 to 2170 of SEQ ID NO:34, b is an 
integer of 1 5 to 2 1 84, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:34, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 25 

When tested against Jurkat T-cell lines, supernatants removed from cells 
containing this gene activated the gamma activating sequence (GAS), a promoter 
element found upstream of many genes which are involved in the Jak-STAT pathway. 
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The Jak-STAT pathway is a large, signal transduction pathway involved in the 
differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
i o pathway, reflected by the binding of the GAS element, can be used to indicate 

proteins involved in the proliferation and differentiation of cells. Thus, it is likely that 
5 this gene activates T-cells through the JakStat signal transduction pathway. 

This gene is expressed in a variety of human normal and diseased tissues 
including breast, infant adrenal gland, skin tumor, colon, pituitary, Wilms tumor, and 
to a lesser extent in other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential Identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, breast cancer and other proliferative disorders, afflicting endocrine or 
endothelial tissues/Similarly, polypeptides and antibodies directed to these 
25 polypeptides are useful in providing immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the endocrine system or of breast and/br breast lymph 
nodes, expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues or cell types (e.g., reproductive, endocrine, or cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
35 such a disorde »"> relative to the standard gene expression level, i.e., the expression 

level in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in breast, infant adrenal gland, skin tumor, colon, 
pituitary, and Wilm's tumor, and biological activity in activating the GAS promoter 
element indicates that polynucleotides and polypeptides corresponding to this gene 
arc useful for the detection, treatment, and/or prevention of various endocrine 
disorders and cancers, particularly Addison's Disease, Cushing's Syndrome, and 
disorders and/or cancers of the pancreas (e.g., diabetes mellitus), adrenal cortex, 
ovaries, pituitary (e.g., hyper-, hypopituitarism), thyroid (e.g., hyper-, 
hypothyroidism), parathyroid (e.g., hyper- hypoparathyroidism) f hypothalamus, and 
testes. Alternatively, the tissue distribution and biological activity indicates that 
50 polynucleotides and polypeptides corresponding to this gene are useful for the 
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diagnosis and treatment of cancer and other proliferative disorders. Expression within 
embryonic tissue and other cellular sources marked by proliferating cells, (i.e., breast, 
skin and Wilms tumors) indicates that this protein may play a role in the regulation of 
cellular division. Additionally, the expression in hematopoietic cells and tissues 
indicates that this protein may play a role in the proliferation, differentiation, and/or 
survival of hematopoietic cell lineages. Representative uses arc described in the 
"Inimurte Activity " and "infectious disease" sections below, in Example 11, 13, 14, 
16, 18, 19, 20, and 27, and elsewhere herein. In such an event, this gene is useful in 

the treatment of lymphoproliferative disorders, and in the maintenance and 

J" 

differentiation of various hematopoietic lineages from early hematopoietic stem and 
committed progenitor celts. Similarly, embryonic development also involves 
decisions involving cell differentiation and/or apoptosis in pattern formation! Thus 
this protein may also be involved in apoptosis or tissue differentiation and could again 
be useful in cancer therapy.Funherrnore, the protein may also be used to determine 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate their interactions, in addition to its use as a 
nutritional supplement. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:35 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between i to 935 of SEQ ID NO:35, b is an 
integer of 15 to 949, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:35, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 26 

In another embodiment, polypeptides comprising the amino acid sequence of 

the open reading frame upstream of the predicted signal peptide are contemplated by 

the present invention. Specifically, polypeptides of the invention comprise the 

5 following amino acid sequence: TGTFWSPRSQRRGCCGRRAPRPEAMENGAVYS 

PTTEEDPGPARGPRSGLAAYTFMGRLPLLRRVLKGLQLLLSLXAFICEEVVSQ 
CTLCGGLYTTEFVSCSAFLLSLLILW 

FLLAS IIFVSTI I DRTS AEIAA 1 VFGFIAS FMFLLDFITML YEKRQESQLRKPENTT 
RAEALTEPLNA (SEQ ID NO: 305). Polynucleotides encoding these polypeptides 
1 0 are also provided. 

Hie gene encoding the disclosed cDNA is believed to reside on chromosome 
3. Accordingly, polynucleotides related to this invention arc useful as a marker in 
linkage analysis for chromosome 3. 

This gene is expressed primarily in dendritic cells, and to a lesser extent in 
melanocytes, fetal liver and spleen and several other tissues. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
b:o!cg:cc! sample and for diagnosis of diseases and conditions which include, but are 
not limited to, inflammation, and disorders of the hepatic and immune systems. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the lissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune and hematopoietic systems, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., hematopoietic, 
hepatic, immune, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 149 as residues: Phe-63 to Ser-75. Thr-97 to Ser- 102, 
Glu- 128 to Arg-143. Polynucleotides encoding said polypeptides are also provided. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of a variety of 
immune system disorders. Representative uses are described in the "Immune 
Activity" and "infectious disease" sections below, in Example 1 I T 13, 14, 16, 18, 19, 
20, and 27, and elsewhere herein. Briefly, the expression of this gene product in 
dendritic cells indicates a role in the regulation of the proliferation; survival; 
differentiation; and/or activation of potentially all hematopoietic cell lineages, 
including blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation, or other processes that may also suggest a usefulness 
in the treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Alternatively, the tissue distribution in fetal liver indicates that polynucleotides 
and polypeptides corresponding to this gene are useful for the detection and treatment 
of liver disorders and cancers (e.g., hepatoblastoma, jaundice, hepatitis, liver 
metabolic diseases and conditions that are attributable to the differentiation of 
hepatocytc progenitor cells). In addition the expression in fetus would suggest a 
useful role for the protein product in developmental abnormalities, fetal deficiencies, 
pre-natal disorders and various would-healing models and/or tissue trauma. 
Furthermore, the protein may also be used to determine biological activity, raise 
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antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 
5 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 36 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
10 cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 3324 of SEQ ID NO: 36, b is an 
integer of 15 to 3338, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:36, and where b is greater than or equal to a + 14. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 27 

In a specific embodiment, polypeptides comprising the amino acid sequence 
of the open reading frame upstream of the predicted signal peptide are contemplated 
by the present invention. Specifically, polypeptides of the invention comprise the 

20 following amino acid sequence: ASAPRVMRGHLAGFPALSGLASVCLWATFSA 
QLPGPVAATSWTPAPLGCSAARSGPEKRLGTAAPGSAASLAQAGPGAPCRV 
LPVDPAPAALNVREPGWLGGLFDGALXQVXLNFLRKSTDVLMDTREAESLEV 
E(SEQ ID NO: 306). 

In another embodiment polypeptides of the invention comprise the following 

25 amino acid sequence: NKLHSFPVFLSQLLLDRQLLHAPQTLPTPHCGGSSRPGP 
SHPPWLLIQLPCVHVALWQMLRDFSDSRITPSTLTTQPAAQTAAPAKDQES 
DIVGGEGILCD1AFLQEDHPLGVGGASAPSSRRELSRRGVHTQTLPEDGTLHG 
TPSSSFDCGIKYIISWPLAPGCDLPSLELSLVCKGVSSCMGFAAG (SEQ ID NO: 
307). Polynucleotides encoding these polypeptides are also provided. 

30 This gene is expressed primarily in endothelial cells, lung, and fetal kidney, 

and to a lesser extent in epididymis, keratinocytes and cerebellum. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
10 biological sample and for diagnosis of diseases and conditions which include, but are 

hot limited to, cardiovascular diseases involving endothelial cell disturbances such as 
5 atherosclerosis. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differentia] identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the cardiovascular system, expression of this gene at significantly 
higher or lower levels is routinely detected in certain tissues or cell types (e.g., 
10 cardiovascular, or cancerous and wounded tissues) or bodily fluids (e.g., lymph, 

serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
25 individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 150 as residues: Arg-47 to Leu-54. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in endothelial cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosing and treating 
20 disorders of endothelial cells such as atheroschlerosis, vasculitis, cardiovascular 
35 disease ' ani emphysema. The secreted protein can also be used to determine 

biological activity, to raise antibodies, as tissue markers, to isolate cognate ligandsor 
receptors, to identify agents that modulate their interactions and as nutritional 
supplements. The polypeptide may possess a wide range of undetected biological 
25 activities. Typical of these are cytokine, cell proliferauoa/diffcrentiation modulating 
activity or induction olother cytokines; immunostimulating/immunosuppressant 
activities (e.g.. for treating human immunodeficiency virus infection, cancer, 
autoimmunediseascs and allergy); regulation of haematopoiesis (e.g.. for 
treatinganaeiuia or as adjunct to chemotherapy); stimulation of growth of 
30 bone.carti.age. tendons, ligaments and/or nerves (e.g., for treating wou„d s . s ,imulation 
of follicle stimulating hormone (for control of fertility); chemotactic and 
chemokinetic activities (e.g., for treating infections, tumours); haemostatic or 
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thrombolytic activity (e.g., for treating haemophilia, cardiac infarction etc.); anti- 
inflammatory activity (e.g., for treating septic shock, Crohn's Disease); as 
10 antimicrobials; for treating psoriasis or other hyperproiiferative disease; for regulation 

of metabolism, behaviour, and many others. Also contemplated is the use of the 
5 corresponding nucleic acid in gene therapy procedures. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:37 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
25 morc Polynucleotides comprising a nucleotide sequence described by the general 

15 formulaofa-b, where a is any integer between 1 to 1549 of SEQ ID NO:37, b is an 
integer of 1 5 to 1563, where both a and b correspond to the positions of nucleotide 
3Q rcsidues shown in SE Q ID NO;37. and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 28 

20 In another embodiment, polypeptides comprising the amino acid sequence of 

35 the °P en readin S frame upstream of the predicted signal peptide are contemplated by 

the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: PGRPTRPTKNKVCVCLGMLFWAYPICVFIDSL 
SCQPCLWSTGATSHFNSPTTSPLFTLFMPCALAPNPFT QLGKLDDR (SEQ ID 
25 NO: 308). Polynucleotides encoding these polypeptides are also provided. 
This gene is expressed primarily in meningima. 

Therefore, polynucleotides and polypeptides of the invenuon are useful as 
reagents for differentia! identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
30 not limited to, tumors or disorders of the centra! nervous system. Similarly, 

polypeptides and antibodies directed to these polypeptides are useful in providing 
50 immunological probes for differential identification of the tissuefs) or cell type(s). For 
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a number of disorders of the above tissues or cells, particularly of the central nervous 
system, expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues or cell types (e.g.. neural, or cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal 
5 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: I51 as residues: His-29 to Thr-34. Polynucleotides 
20 10 en c«ling said polypeptides are also provided. 

The tissue distribution in mcningima indicates polynucleotides and 
polypeptides corresponding to this gene are useful for the detection, treatment, and/or 
prevention of neurodegenerative disease states, behavioral disorders, or inflammatory 
conditions. Representative uses are described in the "Regeneration" and 
15 "Hyperproliferative Disorders" sections below, in Example II, 15, and 18, and 
elsewhere herein. Briefly, the uses include, but are not limited to the detection, 
treatment, and/or prevention or Alzheimer's Disease, Parkinson's Disease, 
K--ir.g-.or.'s Oisease. Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
20 cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
35 mania - demenlia - Paranoia, obsessive compulsive disorder, depression, panic disorder. 

learning disabilities. ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception, as well as disorders of 
^ the meninges such as meningioma and meningitis. In addition, the gene or gene 

25 product may also play a role in the treatment and/or detection of developmental 
disorders associated with the developing embryo, sexually-liuked disorders, or 
disorders of the cardiovascular system. Furthermore, the protein may also be used to 
45 determine biological activity, to raise antibodies, as tissue markers, to isolate cognate 

ligands or receptors, to identify agents that modulate their interactions, i„ audition to 
30 its use as a nutritional suppiemcnt. Protein, as well as. antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the ' 
50 above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:38 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, $there a is any integer between 1 to 1034 of SEQ ID NO:38, b is an 
integer of 15 to 1048, where both a and b correspond to the positions of nucleotide 
10 residues shown in SEQ ID NO:38, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 29 

The translation product of this gene has been shown to encode a human brain 
specific mitochondrial carrier (Genbank Accession No. gi!3 85 15401gblAAD04346.il 
1 5 (AF078544); all references available through this accession are hereby incorporated 
herein by reference) which shares sequence homology with the human body weight 
disorder associated gene C5 product which is known to be differentially expressed in 
obese compared to lean mice (See GeneSeq Accession No. R91281). Based on the 
sequence similarity, the translation product of this gene is expected to share at least 
10 some biological activities with mitochondrial carriers proteins. Such activities are 
known in the art, some of which are described in Sanchis et al, J. Biol. Chem. 
273:346 11 -346 15 (1998), incorporated herein by reference. 

Included in this invention as preferred domains are mitochondrial energy 
transfer protein (METP) domains, which were identified using the ProSite analysis 
15 tool (Swiss Institute of Bioinformatics). Structurally, members of the family of 

mitochondrial energy transfer proteins consist of three tandem repeats of a domain of 
approximately one hundred residues. Each of these domains contains two 
transmembrane regions. As a signature partem, we selected one of the most conserved 
regions in the repeated domain, located just after the first transmembrane region. To 
0 detect this widespread family of proteins, a consensus sequence was developed that 
contains the most conserved regions in the repeated domain. The consensus pattern is 
as follows: P.[DE].fLIVAT)[RlCJ.[LRH][LIVMFY][QMAIGV]. 
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Preferred polypeptides of the invention comprise the following amino acid 
sequences: PVDLTKTRLQ (SEQ ID NO: 309) and PTDVLK1RMQ (SEQ ID NO: 
» 3 10 >- Polynucleotides encoding these polypeptides are also provided. 

Further preferred are polypeptides comprising the METP domains of the 
5 sequence listed above, and at least 5. 10. 15. 20, 25, 30. 50, or 75 additional 
1S contiguous amino acid residues of the sequence referenced in Table I for this gene. 

The additional contiguous amino acid residues is N-terminal or C- terminal to the 
METP domain. Alternatively, the additional contiguous amino acid residues is both 
N-terminal and C-terminal to the METP domain, wherein the total N- and C-terminal 
20 10 C0Mi ^ amino acid residues equal the specified number. The above preferred • 

polypeptide domain is characteristic of a signature specific to mitochondrial energy 
transfer proteins. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
X. Accordingly, polynucleotides related to this invention are useful as a marker in 
15 linkage analysis for chromosome X. 

This gene is expressed primarily in brain, and to a lesser extent, in T-cells. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell type( s ) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
20 not limited to, neurological and behavioral disorders and immune disorders and/or 
35 0beSity - Simil ^V. Polypeptides and antibodies directed to these polypeptides are 

useful in providing immunological probes for differential identification of (he 
«ssue(s) or cell ty P e(s). For a number of disorders of the above tissues or cells 
PartiCUlar,y ° f thC diS « eSlive - " e • ^ nervous systems, expression of this' gene 
25 at stgmficantly higher or lower levels is routinely detected in certain tissues or cell 
types (e.g., immune, neural, or cancerous and wounded tissues) or bodily fluids (e.g 
lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
« Samp,c taken from a » ^dividual having such a disorder, relative to the standard gene 

expression level, i.e.. the expression level in healthy tissue or bodily fluid from an 
30 individual not havi n g the disorder. 
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Prcfeired polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 152 as residues: Gln-189 ,o Gly-195. Polynucleotides 
10 encoding said polypeptides are also provided. 

The tissue distribution in brain and homology to mitochondrial carrier proteins 
5 mdicates polynucleotides and polypeptides corresponding to this gene are useful for 
15 ^ dCteCtion ' treatmcnI - and/ 0' prevention of neurodegenerative disease sates. 

behavioral disorders, or inflammatory conditions. Representative uses are described in 
the "Regeneration - and "Hyperproliferative Disorders" sections below, in Example 
M , 15, and 18, and elsewhere herein. Briefly, the uses include, but are not limited to 

Disease, Huntington's Disease. Tourette Syndrome, meningitis, encephalitis, 
demyelinatmg diseases, peripheral neuropathies, neoplasia, trauma, congenital 
malformations, spinal cord injuries, ischemia and infarction, aneurysms, hemorrhages 
schuopnrenia, mania, dementia, paranoia, obsessive compulsive disorder, depression ' 
15 parac disorder, learning disabilities, ALS, psychoses, autism, and altered behaviors ' 
including disorders in feeding, sleep patterns, balance, and perception. In addition, 
elevated expression of this gene product in regions of the brain indicates it plays a 
role is sonra! .-sura! function. 

Potentially, this gene product is involved in synapse formation. 
20 neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
35 SUrViVaL Funhe ™ore, the protein may also be used to determine biological activity 

to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents Aal modulate ((lejr imeraction$ jn addition ^ ^ ^ ^ ^ 

^ supplement. Protein, as well as. antibodies directed against the protein may show 

25 utihly as a tumor marker and/or immunotherapy targets for the above listed tissues. 
Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:39 and may have been publicly available prior to conception of 
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30 excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
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formula of a-b, where a is any integer between I lo l4l6ofSEQrDNO:39,bisan 
integer of 15 to 1430. where both a and b correspond to the positions of nucleotide 
10 R5SidueS Sh0wn in SEO - ID NO-39, and where b is greater than or equal to a + 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 30 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: MTFGSTISPTSTHASPSLGFCCSWLLEDLEEQLYCSAFEEAALTR 
RICNrTSCWLPLDMELLHRQVLALQTQRVLLGMWLRRAWDTWVSPRRVAP 
GSRCLLTASHPClEpRKASAXQRNLGYPl^MLCLLVLTGLSVLIVAIHILEL 
LIDEAAMPRGMQGTSLGQVSFSKLGSFGAVIQVVLIFYLMVSSWGFYSSPLF 
RSLRPRWHDTANfTQHGNCVCLLVLSSALPVFSRTLGLTRFDLLGDFGRFNWL 
GNFY1VFLYNAAFAGLTTLCLVKTFTAAVRAELIRAFGLDRLPLPVSGFPQAS 
RKTQHQ (SEQ ID NO: 31 . ) Po.ynuc.eot.des encoding such polypeptides are also 
25 provided. 

15 This gene is expressed primarily in immune system tissues (e.g. resting T- 

cells. primary dendritic cells, and neutrophils, apoptotic T-cells) and umbilical vein. 
This gene is expressed to a lesser extent in the gastrointestinal tissue (e.g. small 
inline, colon), brain (e.g. cerebellum, frontal cortex), aorta endothelial cells, skin 
tumor, embryonic tissue, thymus, and cancers (e.g. cheek, breast, synovial). 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the ti.,sue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer and immune disorders. Similarly, polypeptides and antibodies 
ducted to these polypeptides are useful in providing immunological probes for 
25 different .dentification of the tissue(s) or cell ty P e(s). For a number of disorders of 
the above tissues or cells, panicu.arly of the immune system and gastrointestinal tract 
expression of this gene at significantly higher or lower levels is routinely detected in 
« CCrtain tiSSUeS ° r CeH «W« (e g-, immune, gastrointestinal, cancerous and wounded 

tissues) or bodily fluids (e.g.. ara „io, ic , serum, plasma, urine, synovial fluid and 
30 spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
50 . health " tissuc or bodi| y A"*" from an individual not having the disorder. 
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Preferred poiypeptides of the present invention comprise immunogenic 
epnopes shown i„ SEQ ID NO: ,53 as residues: Asp . 2 , to Scr . 29 Polynucieotjdes 
10 encod,n 8 said polypeptides are also provided. 

The tissue distribution in immune cells (e.g. T-cells. dendritic celJs 

Representee uses are descnoed in the "Immune Activity ^ , nfec(ious ^ 
s^ons tatow. in Example , 13 , 4 16> lg J9 2q ^ ^ 

prohferauon; ^ differentiation; and/or activation of hematopo.etic cel. - 

^ b ' 00d ^ Ce,1S " ^ gCnC ^ jS -'^in,he reguIationof 
overproduction, an , ig e„ presentation, or other processes suggesting . 
"> the treatment of cancer (e.g. by boosting immune responses). 

S,nce the gene is expressed i„ ce.ls of lymphoid origin, the natural gene 
produc, ,s mvo ,ved in i mmunc function , Tnerefoffi fc u ^ ^ ^ _ 

tT ' rheumatoid arthritis, granulomatous Disease, inflammatory 

stcT s ^ n' T* neUtr0Pe " ia - ^-^.ties. 
uch as T-cel. mediated cytotoxicity; immune reactions to transited organs and 

orders such as autoimmune inferti.ity, lense tissue injury, demyehnation, systemic 

e«„ofste m ^ 

* *» to mo, and cancerous , lSS ue (e.g. chee k . brcas, synovial sarcorna) indicaKs 
hat poiynuceottdes and po.ypep.des corresponding to this gene are user , for the 

*- Prorem may play a role m tne of ^.^^ ^ 
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expression in hematopoietic cells and tissues indicates that this protein may play a 
role in the proliferation, differentiation, and/or survival of hematopoietic cell Uncages 
1,1 SUCh CVent ' tWs is useft " » ** ^atment of lymphopro.ifera.ive disorders 
and .„ the maintenance and differentiation of various hematopoietic lineages from 
5 early hematopoietic stem and committed progenitor ceils. Similarly, embryonic 
^ development also involves decisions involving cell differentiation and/or apoptosis in 

pattern formation. Thus this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Protein, as well as 
antibodies directed against the protein may show utility as a tumor marker and/or 
20 10 ,mmi ™therapy targets for the above listed tissues. The tissue distribution in " 

cerebellum and frontal cortex indicates polynucleotides and polypeptides 
corresponding to this gene are ttsefu. for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditioas 
Representative uses are described in the "Regeneration" and "Hyperproliferative 
1> Dtsorders" sections below, in Example 11,15, a „d . 8, and elsewhere herein Briefly 
the uses include, but ore not limited to the detection, treatment, and/or prevention of 
Altar's Disease. Parkinson's Disease, Huntington's Disease. Tourettc Syndrome, 
iHffl-gHa. encephalitis, demyelinating dtseases, peripheral neuropathies, neoplasia 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction 
20 aneurysms, hemorrhages, schizophrenia, mania, dementia. paranoia, obsessive 
35 compulsive disorder, depression, panic disorder, teaming disabilities. ALS 

psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated expression of this gene product 
^ m regions of the brain indicates it plays a role in normal neural function. 

25 Potentially, this gene product is involved in synapse formation, 

neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survval. Furthermore, the protein may also be used to determine bio.ogical activity 
« t0 antibodies - ^ tissue markers, to isolate cognate ligands or receptors to 

•dentify agents mat modu.ate their interactions, in addition to its use as a nutritional 
30 supplement. Protein, as wcl. as, antibodies directed against the protein may show 
uuhty as a tumor marker and/or immunotherapy targets for the above Hsted tissues 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related t0 SEO - ,D m A0 and "»y h^e been publicly available prior to conception of 
*e present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence is 
15 cumbersome. Accordingly, preferably excluded from the present invention are one or 

more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between l to 2089 of SEQ ID NO:40, b is an 
integer of 15 to 2103. whefe both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:40, and where b is greater than or equaJ to a + 14. '. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 31 
25 Thc P^Ptide of this gene has been determined to have a zinc finger ( Zinc finger, 

C2H2 type) domain at about amino acid position 16-50 of the amino acid sequence 
15 referenced in Table I for this gene. Therefore, 

A preferred polypeptide fragment of the invention comprises the following 
30 amino acid sequence: LCVCLVYLCMYGVCLCVIVCVSGVSLCLYVWGVSVC 

DCVSVFMCVCLCVIFCVYGKPRTEHYHSPHLAKQKAPREMCGRHDVSAAGIF 
QSYV (SEQ ID NO: 312). Polynucleotides encoding these polypeptides are also 
35 20 P rovided - 'Zinc Alger' domains are nucleic acid-binding protein structures first 

identified in the Xenopus transcription factor TF11IA. These domains have since been 
found in numerous nucleic acid-binding proteins. A zinc finger domain is composed 
of 25 to 30 amino-acid residues. There are two cysteine or histidine residues at both 
«, extremities of the domain, which are involved in the tetraherlral coordination of a zinc 

atom. It has been proposed that such a domain interacts with about five nucleotides. A 
schematic representation of a zinc finger domain is shown below: 
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Many classes of zinc fingers are characterized according to the number and 
positions of the histidine and cysteine residues involved in the zinc atom coordination. 
In the first class to be characterized, called C2H2. the first pair of zinc coordinating 
residues are cysteines, while the second pair are histidir.es. A number of experimental 
reports have demonstrated the zinc- dependent DNA or RNA binding property of 
some members of this class. Some of the proteins known to include C2H2-type zi nc 
fingers are listed below. We have indicated, between brackets, the number of zinc 
finger regions found in each of these proteins; a V symbol indicates that only partial 
sequence data is available and that additional finger domains is present. 
This gene is expressed primarily in salivary gland. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissues) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited lo, salivary gland related diseases, diseases of the mouth, and other 
digestive disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue^) or cell rype( S ). For a number of disorders of the above 
«, tissues or cells, particularly of the digestive system, expression of this gene at 

significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., saliva, serum, plasma, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
ep.topes shown in SEQ ID NO: .54 as residues: Gly-46 to His-54. Po.ynuc.eocidcs 
encoding said polypeptides are also provided. 

The tissue distribution indicates that the protein products of this gene are 
5 useful for diagnosis and treatment of salivary gland related diseases (mumps. ca I culi 
formation in ducts, sarcoidosis, facial palsy, tumors, Sjogren* Syndrome, and other 
Agesuve system disorders. Furthermore, the protein may also be used to determine 
b.olog 1C al activity, raise antibodies, as tissue markers, to isolate cognate hgands or 
receptor, to identify agents that modulate the.r interactions, in addition ,o its use as a 
10 nutritional supplement. Protein, as well as, antibodies directed against the proteiamay 
show utihty as a tumor marker and/or immunotherapy targets for the above listed' 



tissues. 



is 

one or 



Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
15 related to SEQ ID NO:4! and may have been publicly avai.able prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
cumbersome. Accordingly, preferably excluded from the present invention ate o. 
more polynucleotides comprising a nucleotide sequence described by the general 
20 formula of a-b, where a is any integer between 1 to 2335 of SEQ ID NO 4 1 b is an 
integer of 15 to 2349. where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:41, and where b is greater than or equal ,o a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 32 
25 This gene is expressed primarily in fetal tissue (e.g. spleen, liver, brain) 

cancerous tissues (e.g. ovarian, colon, stomach, parathyroid) and to a lesser extent in 
— e cel.s and tissue (e.g. B-cel.s, T-ce,| s . bone marrow), and reproductive organ, 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell ,y pe(s ) presem in a 
30 b,olo gl cal sample and for diagnosis of diseases and conditions which include but are 
not hrmted to. cancer, particularly of .he colon and ovaries, disorders of the ' 
developing fetus, neurodegenerative conditions, and immune system disorders 
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Snarly, polypeptides and antibodies dieted to these poiypeptides m ^ in 
prov.dtng trnrnuno.ogica. probes for differentia! identification of the ,is Sue (s) or cel. 
«ype(s). For a number of disorders of the above tissues or ceils, particularly of the 
.mmune system> expression of ^ gene ^ sigmficamly ^ ^ ^ ^ ^ 

5 routmely detected in certain tissues or eel, types (e.g., rcproductive ^ 

cancerous and wounded tissues) or bodily fluids (e.g., serum. plasma> urine , ^ 
flutd and spinal fluid, or another tissue or cel. sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or.bodily fluid from an individual not having tncdisor£ler 
20 Preferred polypepudes of the present invention co.nprise immunogenic - 

epuopes shown in SEQ ID NO: ,55 as residues: Lys-35 to Lys^. Po.ynuCeotides' 
encoding said polypeptides are also provided. 

The expression of this gene within fetal tissue and other ceHuIar soutces marked by 
prohferating cells indicates this protein may p , ay a ro.e in the regulation of cellular 
divtston. and may show utility in the diagnosis, treatment, and/or prevention of 
developmental diseases and disorders, including cancer, and other proliferative 
condmon, Representative uses are described in the "Hyp.rpro.ifen.uve Disorders" 
a-"-d Regeneration" sections below and elsewhere herein. Briefly, developmental 
t,sues re.y on decisions involving cel. differentiation a.d/or apoptosis in pattern 
-JO formation. 

Dysregulation of apoptosis can result i„ i„ a p propriatc suppressjon Qf ^ 
*-U « occurs in the development of some cancers, or i„ failure to control the extent 
of cel. death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spina, m Uscular atrophy (SMA) ^ 
potential ro.es in proliferation and differentiation, this gene product may have 
apphcations in the adul, for tissue regeneration and the treatment of cancers It may 
also act as a morphogen to contro. ceil and tissue type specification. Therefore the 
polynucleotides and po.ypepndcs 0 f the present invents are useful in treats 
detect, and/or preventing said disorders and condition, in addition to others 
of generative conditions. Thus tins protein may modulate apoptos.s or tissue 
d.ffercmiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
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modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into .he regulation of cellular growth and proliferation. The tissue distribution 
in immune cells (such as T-cells and B-cells) and immune tissues (bone marrow) 
indicates polynucleotides and polypeptides corresponding to this gene are useful for 
me diagnosis and treatment of a variety of immune system disorders. Representative 
uses are described in the "Immune Activity" and "infectious disease" sections below, 
in Example 11,13, 14. 16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the 
expression of this gene product indicates a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including - 
blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bawl disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-celi mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility. le „ se tissue injury, demyelination. systemic- 
lupus erythematous, drug induced hemolytic anemia, rheumatoid arthritis. Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. The tissue distribution 
in parathyroid indicates polynucleotides and polypeptides corresponding to this gene 
are useful for the detection, treatment, and/or prevention of various endocrine 
disorders and cancers. Representative uses are described in the "Biological Activity", 
"Hyperproliferative Disorders", and "Binding Activity" sections helow. in Example 
1 1, 17, 18, 19, 20 and 27. and elsewhere herein. Briefly, the protein can be used for 
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the detection, treatment, and/or prevention of Addison's Disease, Cushing's 
Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes mellitus), 
10 adrenal C0ITex - ovaries ' P ituitarv hyper-, hypopituitarism), thyroid (e.g. hypw-. 

hypothyroidism), parathyroid (e.g. hyper-.hypoparathyroidism) . hypothallamus. and 
5 testes. Additionally, the tissue distribution in brain tissue indicates polynucleotides 
K " nd WmPltaa corresponding to this gene are useful for the detection, treatment, 

and/or prevention of neurodegenerative disease states, behavioral disorders, or 
inflammatory conditions. Representative uses are described in the "Regeneration" and 
yperp " erative Disorders sections below, in Example ll, 15, and 18 and 
20 10 elsewf >ere herein. Briefly, the uses include, but arc not limited to the detection. - 

treatment, and/or prevention of Alzheimer's Disease. Parkinson s Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spiral 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities. ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain indicates it plays a role in 
normal neural function. 

Potentially, this gene product is involved in synapse formation. 
35 neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 

survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
^ identify agents that modulate their interactions, in addition to its use as a nutritional 

25 supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 
Many polynucleotide sequences, such as EST sequences, are publicly 
45 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:42 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 



25 

15 



30 



20 



30 



50 



55 



WO 00/06698 PCTAJS99/I7U0 
5 9t 

more polynucleotides comprising a nucleotide sequence described by - the general 
formula of a-b, where a is any integer between I to 1545 of SEQ ID NO:42, b is an 
10 integer of 1 5 t0 1 559 ' where both a and b correspond to the positions of nucleotide 

residues shown in SEQ ID NO:42, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 33 

When tested against U937 Myeloid cell lines, supernatant* removed from cells 
containing this gene activated the GAS assay. Thus, it is likely that this gene activates 
myeloid cells through t£e Jak-STAT signal transduction pathway. The gamma 
20 10 ^^ng sequence (GAS) is a promoter element found upstream of many genes- 

which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large,' 
signal transduction pathway involved in the differentiation and proliferation of cells. 
Therefore, activation of the Jak-STAT pathway, reflected by the binding of the GAS 
element, can be used to indicate proteins involved in the proliferation and 
1 5 differentiation of cells. 

This gene is expressed primarily in skin tumors. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents fcr differentia! identification of the tissue(s) or cell typc(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, skin disorders, paniculary skin cancer. Similarly, polypeptides and 
35 antibodies directed to these polypeptides are useful in providing immunological 

probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the skin, expression of this gene 
^ at significandy higher or lower levels is routinely detected in certain tissues or cell 

25 types (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e„ 
45 thc session level in healthy tissue or bodily fluid from an individual not having the 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 156 as residues: Pro-38 to Gly-44, Phe-56 toThr-64. 
Polynucleotides encoding said polypeptides are also provided. 
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The tissue distribution in skin indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment, diagnosis, and/or prevention 
of various skin disorders including congenital disorders (i.e. nevi, moles, freckles, 
Mongolian spots, hemangiomas, port- wine syndrome), integumentary tumors (i.e. 
keratoses, Bo wen's Disease, basal cell carcinoma, squamous cell carcinoma, 
malignant melanoma, Paget s Disease, mycosis fungoides, and Kaposi's sarcoma), 
injuries and inflammation of the skin (i.e.wounds, rashes, prickly heat disorder, 
psoriasis, dermatitis), atherosclerosis, uticaria, eczema, photosensitivity, autoimmune 
disorders (i.e. lupus erythematosus, vitiligo, dermatomyositis, morphea, scleroderma, 
pemphigoid, and pemphigus), keloids, striae, erythema, petechias purpura, and 
xanthelasma. Moreover, such disorders may predispose increased susceptibility to 
viral and bacterial infections of the skin (i.e. cold sores, warts, chickenpox, 
molluscum contagiosurn, herpes zoster, boils, cellulitis, erysipelas, impetigo, tinea, 
althletes foot, and ringworm). Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and immunotherapy targets for the above 
listed tumors and tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:43 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1752 of SEQ ID NO:43 t b is an 
integer of 15 to 1766, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:43. and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 34 

The translation product of this gene shares sequence homology with mitogen- 
induced prostate carcinoma (mouse) which is thought to be important in the etiology 
of cancer. In this respect, this gene is mitogen-induced anaVor involved in cell 
proliferation. 
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Preferred polypeptides of the invention comprise the following amino acid 
sequence: GHMPYGWLTEntAVTPAFDKNNPSNKLVSTSMVTAAHIKKF 
TFVCMALSLTLCFVMFVVTPNVSEKILIDnGVDFAFAELCVWLRIFSI^PVPVT 
VRAHLTGWLMTLKKTFVLAPSSVLRIIVLMSLVVLPYLGVHGATLGV 
5 GFVGESTMVAIAACYVTRKQKKK 

MREENE (SEQ ID NO: 3 13) and/or QVVFVAILLHSHLECREPLLIPELSLYMGA 
LVRCTTLCLGYYKNmDIIPDRSGPELGGDATIRKMLSFWWPLAL^ 
PIVNLFVSRDLGGSSAATEAVAJLTATYPV (SEQ ID NO: 314). Polynucleotides 

encoding these polypeptides are also provided. 

■ ^ $ 

20 ' 115(5 gene cnc «Knfi the disclosed cDN A is believed to reside on chromosome. 

5. Accordingly, polynucleotides related to this invention are useful as a marker in * 
linkage analysis for chromosome 5. 

This gene is expressed primarily in early infant and adult brain, retina, fetal 
tissue (e.g., liver, speen, whole embryo) and to a lesser extent in immune cells (e.g., 
15 monocytes and T-ceUs), colon, and parathyroid tumor tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to 7 cancers, disorders of the immune system and nervous system. 
20 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
35 providing immunological probes for differential identification of the tissue(s) or cell 

typc(s). For a number of disorders of the above tissues or cells, particularly of the 
metabolic system (cancers), expression of this gene at significantly higher or lower 

levels is routinely detected in certain tissues or cell types (e.g., immune, neural 
40 A f * 

25 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

fluid and spinal fluid) or another tissue or cell sample taken from an individual having 

such a disorder, relative to the standard gene expression level, i.e., the expression 

45 Ievcl in heakhv tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 

30 epitopes shown in SEQ ID NO: 157 as residues: Arg-122 to Ser-139, Mel- 144 to Glu- 

149. Polynucleotides encoding said polypeptides are also provided. 

50 
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The tissue distribution and homology to mitogen induced prostate carcinoma 
(mouse) indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for the study and treatment of cancers, including but not limited to the 
colon, parathyroid, and adrenal glands. Moreover, the expression within fetal tissue 
and other cellular sources marked by proliferating cells indicates this protein may 
play a role in the regulation of cellular division, and may show utility in the diagnosis, 
treatment, and/or prevention of developmental diseases and disorders, including 
cancer, and other proliferative conditions. Representative uses are described in the 
"Hyperproliferative Disorders" and "Regeneration" sections below and elsewhere 
herein. Briefly, developmental tissues rely on decisions involving cell differentiation 
and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
a!so act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. The tissue distribution 
in immune cells (T-cells, monocytes) indicates polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of a variety of 
immune system disorders. Representative uses are described in the "Immune 
Activity" and "infectious disease" sections below, in Example 11,13, 14, 16, 18, 19, 
20, and 27, and elsewhere herein. Briefly, the expression of this gene product 
indicates a role in regulating the proliferation; survival; differentiation; and/or 
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activation of hematopoietic cell lineages, including blood stem ceils. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
10 other processes suggesting a usefulness in the treatment of cancer (e.g. by boosting 

immune responses). 

5 Since the gene is expressed in cells of lymphoid origin, the natural gene 

product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and grafi-versus-host diseases, or autoimmunity ' 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
25 Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 

15 influences the differentiation or behavior of other blood cells, or that recruits 

hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
3Q mc expansion of stem cells and committed progenitors of various blood lineages, and 

in the differentiation anoVor proliferation of various cell types. The tissue distribution 
in brain indicates polynucleotides and polypeptides corresponding to this gene are 
20 useful for the detection, treatment, and/or prevention of neurodegenerative disease 
35 slatcs ' behavioral disorders, or inflammatory conditions. Representative uses are 

described in the "Regeneration" and "Hyperproliferative Disorders" sections below, in 
Example II, 15, and 18, and elsewhere herein. Briefly, the uses include, but are not 
limited to the detection, treatment, and/or prevention of Alzheimer's Disease. 
40 25 Parkinson's Disease, Huntington's Disease, Tourette Syndrome, meningitis, 

encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, trauma, 
congenital malformations, spinal cord injuries, ischemia and infarction, aneurysms, 
45 hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive compulsive 

disorder, depression, panic disorder, learning disabilities. ALS, psychoses, autism,, 
30 and altered behaviors, including disorders in feeding, sleep patterns, balance, and 
perception. In addition, elevated expression of this gene product in regions of the 
50 brain indicates it plays a role in normal neural function. 
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Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate Iigands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:44 and may have been publicly available prior to conception pf 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2558 of SEQ ID NO:44, b is an 
integer of 15 to 2572, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:44, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED B V GENE NO: 35 

This gene is expressed primarily in adult pulmonary tissue, umbilical vein, 
prostate, and fetal tissue (e.g.,heart). 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, diseases of the pulmonary system. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the pulmonary system, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., pulmonary, cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
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standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 158 as residues: Arg-45 to GIy-5 1 , Glu-75 to Asn-81. 
Polynucleotides encoding said polypeptides arc also provided. 

The tissue distribution in pulmonary tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the detection and treatment of 
disorders associated with developing lungs, particularly in premature infants where 
the lungs arc the last tissues to develop. Additionally, the tissue distribution indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and intervention of lung nimors, since the gene is involved in the regulation 
of cell division, particularly since it is expressed in fetal tissue. Moreover, the 
expression within fetal tissue and other cellular sources marked by proliferating cells 
indicates this protein may play a role in the regulation of cellular division, and may 
show utility in the diagnosis, treatment, and/or prevention of developmental diseases 
and disorders, including cancer, and other proliferative conditions. Representative 
uses are described in the "Hyperproliferative Disorders" and "Regeneration" sections 
below and elsewhere herein. Briefly, developmental tissues rely on decisions 
involving cell differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications iu the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
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proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, arc publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:45 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of ah, where a is any integer between 1 to 512 of SEQ ID NO:45, b is an 
integer of 15 to 526, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:45, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 36 

This gene is expressed primarily in adipose tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the lissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, fat metabolism. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the metabolic system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g.. serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 



WO 00/06698 



99 



PCT/US99/I7I30 



the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 159 as residues: Pro-96 to Ser-106. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in adipose tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment of obesity and 
other metabolic and endocrine conditions or disorders. Furthermore, the protein 
product of this gene may show utility in ameliorating conditions which occur 
secondary to aberrant fatty-acid metabolism (e.g. aberrant myelin sheath 
development), either directly or indirectly. Protein, as well as, antibodies directed * 
against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:46 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1018 of SEQ ID NO:46, b is an 
integer of 15 to 1032, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:46, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 37 

This gene is expressed primarily in adult brain tissue, testes, placenta, kidney, 
infant and fetal tissue (e.g. ,liver, spleen, lung) and to a lesser extent in immune cells 
(e.g.,T-ce!ls and neutrophils) and in cancerous tissues (e.g., ovarian tumor, Hodgekins 
lymphoma, pancreas, T-cell). 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
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not limited to, CNS disorders, disorders of the testicles, cancer, particularly ovarian, 
pancreatic, T-ccIL and Hodgekins lymphoma. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain,CNS, and testes expression of this 
gene at significantly higher or lower levels is routinely detected in certain tissues or 
ceil types (e.g.. neural urogenital, cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in brain indicates polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions. 
Representative uses are described in the "Regeneration" and "Hyperproliferative 
Disorders" sections below, in Example 11,15, and 18, and elsewhere herein. Briefly, 
the uses include, but are noL limited to the detection, treatment, and/or prevention of 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease. Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated expression of this gene product 
in regions of the brain indicates it plays a role in normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Moreover, the expression within fetal tissue and other cellular sources 
marked by proliferating cells indicates this protein may play a role in the regulation of 
cellular division, and may show utility in the diagnosis, treatment, and/or prevention 
of developmental diseases and disorders, including cancer, and other proliferative 
conditions. Representative uses are described in the "Hyperproliferative Disorders" 
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and "Regeneration" sections below and elsewhere herein. Briefly, developmental 
tissues rely on decisions involving cell differentiation and/or apoptosis in partem 
formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, ' 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. The tissue distribution 
indicates polynucleotides and polypeptides corresponding to this gene are useful for 
the diagnosis and treatment of a variety of immune system disorders. Representative 
uses are described in the "Immune Activity" and "infectious disease" sections below, 
in Example 11,13, 14, 16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the 
expression of this gene product indicates a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including 
blood stem cells. This gene producL is involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
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such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis. drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Additionally, the 
tissue distribution in testes indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and diagnosis of conditions* 
concerning proper testicular function (e.g. endocrine function, sperm maturation), as 
well as cancer. Therefore, this gene product is useful in the treatment of male 
infertility and/or impotence. This gene product is also useful in assays designed to 
identify binding agents, as such agents (antagonists) are useful as male contraceptive 
agents. Similarly, the protein is believed to be useful in the treatment and/or diagnosis 
of testicular cancer. The testes are also a site of active gene expression of transcripts 
that is expressed, particularly at low levels, in other tissues of the body. Therefore, 
this gene product is expressed in other specific tissues or organs where it may play 
related functional roles in olher processes, such as hematopoiesis, inflammation, bone 
formation, and kidney function, to name a few possible target indications. 
Furthermore, the protein may also be used to determine biological activity, raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:47 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
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more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2666 ofSEQ ID NO:47, b is an 
integer of 1 5 to 2680, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:47 ( and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 38 

When tested against fibroblast cell lines, supernatants removed from cells 
containing this gene activated the EGR i assay. Thus, it is likely that this gene 
activates fibroblast cells through a signal transduction pathway. Early growth 
response I (EGR1) is a promoter associated with certain genes that induces various 
tissues and cell types upon activation, leading the cells to undergo differentiation and 
proliferation. 

This gene is expressed primarily in endometrial stromal cells, endometrial 
tumors, keratinocytes, fetal tissue (e.g. liver, spleen) and to a lesser extent in 
endothelial cells and immune cells (e.g., T-cells). 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell typefs) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, endometrial carcinoma and immune cells disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the female 
reproductive system, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., immune, cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial Quid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in the endometrium indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for treating female infertility. The 
protein product is likely involved in preparation of the endometrium of implantation 
and could be administered either topically or orally. Alternatively, this gene could be 
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transfected in gene-replacement treatments into the cells of the endometrium and the 
protein products could be produced. Similarly, these treatments could be performed 
during artificial insemination for the purpose of increasing the likelyhood of 
implantation and development of a healthy embryo. In both cases this gene or its gene 
product could be administered at later stages of pregnancy to promote heathy 
development of the endometrium. Additionally, polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of endometrial 
carcinoma. The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene arc useful for the diagnosis and treatment of cancer and 
other proliferative disorders. Expression within embryonic tissue and other cellular 
sources marked by proliferating cells indicates that this protein may play a role in the' 
regulation of cellular division. Additionally, the expression in hematopoietic cells and 
tissues indicates that this protein may play a role in the proliferation, differentiation, 
and/or survival of hematopoietic cell lineages. In such an event, this gene is useful in 
the treatment of iymphoproliferative disorders, and in the maintenance and 
differentiation of various hematopoietic lineages from early hematopoietic stem and 
commiued progenitor cells. Similarly, embryonic development also involves 
decisions involving cell differentiation and/or apoptosis in pattern formation. Thus 
this protein may also be involved in apoptosis or tissue differentiation and could again 
be useful in cancer therapy. The tissue distribution in immune cells such as helper T- 
cells indicates polynucleotides and polypeptides corresponding to this gene are useful 
for the diagnosis and treatment of a variety of immune system disorders. 
Representative uses are described in the "Immune Activity" and "infectious disease" 
sections below, in Example 1 1, 13, 14, 16, 18, 19, 20, and 27, and elsewhere herein. 
Briefly, the expression of this gene product indicates a role in regulating the 
proliferation; survival; differentiation; and/or activation of hematopoieticcell 
lineages, including hlood stem cells. This gene product is involved in the regulation of 
cytokine production, antigen presentation, or other processes suggesting a usefulness 
in the treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
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as AIDS, leukemia, rheumatoid arthritis, granulomatou's Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lensc tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. The tissue distribution 
in keratinocytes indicates polynucleotides and polypeptides corresponding to this 
gene are useful for the treatment, diagnosis, and/or prevention of various skin 
disorders. Representative uses are described in the "Biological Activity", 
"Hyperproliferative Disorders", "infectious disease", and "Regeneration" sections 
below, in Example 1 1, 19, and 20, and elsewhere herein. Briefly, the protein is useful 
in detecting, treating, and/or preventing congenital disorders (i.e. nevi, moles, 
freckles, Mongolian spots, hemangiomas, port-wine syndrome), integumentary 
tumors (i.e. keratoses. Bowen's Disease, basal cell carcinoma, squamous cell 
carcinoma, malignant melanoma, Paget's Disease, mycosis fungoides, and Kaposi's 
sarcoma), injuries and inflammation of the skin (i.e. wounds, rashes, prickly heat 
disorder, psoriasis, dermatitis), atherosclerosis, uticaria, eczema, photosensitivity, 
autoimmune disorders (i.e. lupus erythematosus, vitiligo, dermatomyositis, morphea, 
scleroderma, pemphigoid, and pemphigus), keloids, striae, erythema, petechiae, 
purpura, and xanthelasma. In addition, such disorders may predispose increased 
susceptibility to viral and bacterial infections of the skin (i.e. cold sores, warts, 
chickenpox, molluscum contagiosum, herpes zoster, boils, cellulitis, erysipelas, 
impetigo, tinea, althlctes foot, and ringworm). Moreover, the protein product of this 
gene may also be useful for the treatment or diagnosis of various connective tissue 
disorders (i.e.. arthritis, trauma, tendonitis, chrondomalacia and inflammation, etc.), 
autoimmune disorders (i.e., rheumatoid arthritis, lupus, scleroderma, 
dermatomyositis, etc.), dwarfism, spinal deformation, joint abnormalities, amd 
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chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, familial 
osteoarthritis, Atelosteogenesis type II, metaphyseal chondrodysplasia type Schmid). 
Furthermore, the protein may also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. - 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:48 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1716 of SEQ ID NO:48, b is an 
integer of 15 to 1730, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:48, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 39 

This gene is expressed primarily in LNCAP cells (prostate cell line) and retina 
derived N2b5HR cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, prostate cancer and eye disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the male urogenital and 
reproductive system expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
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to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 162 as residues: Asn-50 to Ser-57. Polynucleotides 
encoding said polypeptides are also provided. 

The expression in prostate may indicate the gene or its products can be used in the 
disorders of the prostate, including inflammatory disorders, such as chronic 
prostatitis, granulomatous prostatitis and maiacoplakia, prostatic hyperplasia and 
prostate neoplastic disorders, including adenocarcinoma, transitional cell carcinomas, 
ductal carcinomas, squamous cell carcinomas, or as hormones or factors with 
systemic or reproductive functions. The tissue distribution in retina indicates that - 
polynucleotides and polypeptides corresponding to this gene are useful for the 
treatment and/or detection of eye disorders including blindness, color blindness, 
impaired vision, short and long sightedness, retinitis pigmentosa, retinitis proliferans, 
and retinoblastoma, retinochoroiditis, retinopathy and retinoschisis. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences arc 
related to SEQ ID NO:49 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1261 of SEQ ID NO:49, b is an 
integer of 15 to 1275, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:49. and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 40 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: RCCCRGCSCRARLCPPARSTAVAPECRGAHPSR 
AMRPGTAIX^AVIXAVLLVGLRAATC 

TSRRLNFEEAKEACRRGWRPASQHRVLKMNRN (SEQ ID NO: 315). 
Polynucleotides encoding these polypeptides are also provided. 

A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MRPGTALQAVLLAVLLVGLRAATGRLLSGQPVCRGG 
TQRPCYKVIYraDTSRRLNFEEAKEACRRGWRPASQHRVLKMNRN (SEQ ID 
NO: 316). Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in smooth muscle and human thyroid and to a 
lesser extent in amniotic cells and human endometrial stromal cells-treated with 
progesterone. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, thyroid disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissuc(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the endocrine system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 163 as residues: Ser-75 to Leu-8 1 . Polynucleotides 
encoding said polypeptides are also provided. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of endocrine 
disorders of the thyroid. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences arc 
related to SEQ ID NO: 50 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 748 of SEQ ID NO:50, b is an 
integer of 15 to 1762, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:50, and where b is greater than or equaj to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 41 

This gene is expressed primarily in human testes tumor and bone marrow. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders of the testicles including but not limited to testicular 
cancerand immune system disorders. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the male reproductive system and immune system 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., reproductive, immune, cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ fD NO: 164 as residues: His-31 to GIy-41. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in testes, particularly testicular tumors, indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
treatment and diagnosis of conditions concerning proper testicular function (e.g. 
endocrine function, sperm maturation), as well as cancer. Therefore, this gene product 
is useful in the treatment of male infertility and/or impotence. This gene product is 
also useful in assays designed to identify binding agents, as such agents (antagonists) 
are useful as male contraceptive agents. Similarly, the protein is believed to be useful 
in the treatment and/or diagnosis of testicular cancer. The testes are also a site of 
active gene expression of transcripts that is expressed, particularly at low levels, in 
other tissues of the body. Therefore, this gene product is expressed in other specific 
25 dssucs or or 8 ans wnere il ma Y P^y related functional roles in other processes, such as 

hematopoiesis, inflammation, bone formation, and kidney function, to name a few 
possible target indications. The tissue distribution in bone marrow indicates 
polynucleotides and polypeptides corresponding to this gene arc useful for the 
diagnosis and treatment of a variety of immune system disorders. Representative uses 
are described in the "Immune Activity" and "infectious disease" sections below, in 
Example H, 13, 14 t 16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the 
35 expression of this gene product indicates a role in regulating the proliferation; 

survival; differentiation; and/or activation of hematopoietic cell lineages, including 
blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft- versus -host diseases, or autoimmunity 
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disorders, such as autoimmune infertility, iense tissue injury, demyclination, systemic 
lupus erythematosis. drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate I igands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or ' 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:5 1 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides arc specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2045 of SEQ ID NO:51 , b is an 
integer of 15 to 2059, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:5 1 , and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 42 

The translation product of this gene shares sequence homology with 
proLocadherins, which are related to cadherin, and possess cell adhesive ability. 
Cadherins are glycosylated integral membrane proteins that are involved in cell-cell 
adhesion. 

This gene is expressed primarily in brain (infant, adult frontal lobe, manic 
depression tissue) and to a lesser extent in epididymus, healing groin wounds, ovary, 
adipocytes, and fetal tissue (e.g., kidney and retina). 
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Therefore, polynucleotides and polypeptides of the invention an, useful as 
reagents for differentia! identification of the tissue(s) or cell ,ype(s) present in a 
'° b,0l ° 8 ' Cal Samp ' e ^ for di W« °' ""e^es and conditions which include but are 

not hmtted to. neurodegenerative disorders, impaired male and female fertility 
, aeve.opmenta. disorder,, fibrosis, and manic deptesston . Sitnilar.y. po.ypeptides and 
anubod.es directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissues) or cel. «ype ($) . For a numbcr of 
disorders of the above tissues or cells, particularly of the nervous system and 
reproductive system expression of this gene a, significant^ higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., neural, reproductive, cancerous 
and wounded tissues) or bodi.y fluids (e.g.. serum, p.asma. urine, synovia! fluid and 
sp.na. flmd) or another tissue or cell sample taJcen f rom a „ mdividual having such a 
d.sorder, relative to the standard gene expression level, i.e., the expression level i„ 
25 hea ' thy ,isSUC or bodi «y ""id from an individual not having the disorder. 

15 PrefeTOd Peptides of the present invendon comprise immunogenic 

epttopes shown in SEQ ID NO: 165 as residues: Val-35 to Lys-4 1, Ser-68 to 01*73 
30 ° 1U - 88 t0 G,U - 93 ' M S- 156 to G| y A.a-199 to G.y-206. Asp-2 16 to Ser-226 Thr- 

249 to Asn-254. Ah>-339 to Pro-345, rie-370 to Gly-379. Pro-429 to Glu-434 Axg 
46. to Pro-466. Ala-475 toThr-482, Pro-585 to G.y-593. Glu-631 to Gh^39 Pro- 
20 674 to Pro-682, G,„-7,5 to G.y-720. Ser-736 to Arg-742. Polynucleotides encoding 
35 said Polypeptides are also provided. 

BLAST analysis reveals high homology to protocadherin sequences These 
sequences are related to cadherin, and possess cel. adhesive ability. Such proteins 
may have regulatory functions in the cel., as well as the cell-cell adhesive properties 
Anfbodies produced against these sequences are useful for modulating the binding ' 
actmiy of these protocadherins. and can be used therapeutically. The tissue 
Cstnbution in brain indicates polynucleotides and polypeptides corresponding to this 
gene are useful for the detection, treatment, and/or prevention of neurodegenerative 
dtsease states, behavioral disorder, or inflammatory conditions. Representative uses 
30 are described in the "Regeneration" and "Hypcrproliferauve Disorders" sections 

below, in Example I.. .5. and .8, and e.sewhere herein. Briefly, the uses indude. but 
50 are not limited to the detection, treatment, and/or prevention of Alzheimer's Disease 
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Parkinson's Disease, Huntington's Disease, Tourette Syndrome, meningitis, 
encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, trauma, 
congenital malformations, spinal cord injuries, ischemia and infarction, aneurysms, 
hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive compulsive 
disorder, depression, panic disorder, learning disabilities, ALS, psychoses, autism, 
and altered behaviors, including disorders in feeding, sleep patterns, balance, and 
perception. In addition, elevated expression of this gene product in regions of the 
brain indicates it plays a role in norma! neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. The tissue distribution in epididymis indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment and diagnosis of 
conditions concerning proper testicular function (e.g. endocrine function, sperm 
25 maturation), as well as cancer. Therefore, this gene product is useful in the treatment 

of male infertility and/or impotence. This gene product is also useful in assays 
designed to identify binding agents, as such agents (antagonists) are useful as male 
contraceptive agents. Similarly, the protein is believed to be useful in the treatment 
and/or diagnosis of testicular cancer. The testes are also a site of active gene 
expression of transcripts that is expressed, particularly at low levels, in other tissues 
of the body. Therefore, this gene product is expressed in other specific tissues or 
35 0rgans where h ma y P lav relat ed functional roles in odier processes, such as 

hematopoiesis, inflammation, bone formauon, and kidney function, to name a few 
possible target indications. Moreover, the expression within fetal tissue (e.g., kidney 
and retina) and other cellular sources marked by proliferating cells indicates this 
protein may play a role in the regulation of cellular division, and may show utility in 
the diagnosis, treatment, and/or prevention of developmental diseases and disorders, 
including blindness, cancer, and other proliferative conditions. Representative uses 
are described in the "Hyperproliferativc Disorders" and "Regeneration" sections 
below and elsewhere herein. Briefly, developmental tissues rely on decisions 
30 involving cell differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
50 death, as occurs in the development of some cancers, or in failure to control the extent 



20 



40 25 



45 



55 



WO 00/06698 



114 



PCT/US99/I7U0 



10 



15 



of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also aci as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate iigands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:52 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 3268 of SEQ ID NO:52, b is an 
integer of 1 5 to 3282. where hoth a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:52, and where b is greater than or equal to a + 14. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 43 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: 
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IRHEQQGEEDDEHARPLAESLLLAIADLLFCPDFTVQSHRRSTVDSAEDVHSL 
DSCEYIWEAGVGFAHSPQPNYIHDMNRMELLKLLLTCFSEAMYLPPAPESGS 
TNPWVQFFCSTENRHALPLFTSLLNTVCAYDPVGYGIPYNHLLESDYREPLVE 
EAAQVLIVTLDHDSASSASPTVDGTTTGTAMDDADPPGPENLFVNYLSRIHRE 
EDFQFILKGIARLLSNPLLQTYLPNSTKKDPVPPGAASSLLEALRLQQEIPLLRA 
EEQRRTRHPCPHPLLPQRCPGRSV (SEQ ID NO: 317). Polynucleotides encoding 
such polypeptides are also provided. 

This gene is expressed primarily in brain, breast, breast cancer tissue and to a 
lesser extent in epididymis., amniotic cells, and embryo tissue. 
20 10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a ' 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neurodegenerative disorders, impaired CNS function, male sterility, 
and breast cancer. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell typc(s). For a number of disorders of the above 
tissues or cells, particularly of the nervous and reproductive systems, expression of 
this gene at significantly higher or lower levels is routinely detected in certain tissues 
or cell types (e.g.. neural, male reproductive, cancerous and wounded tissues) or 
20 bodily fluids (e.g., amniotic, serum, plasma, urine, synovial fluid and spinal fluid) or 
35 « Mlter tis 'ue or cell sample taken from an individual having such a disorder, relative 

to the standard gene expression level, i.e.. the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 166 as residues: Pro-22 to Pro-3 1 , Ser-38 to His-13, 
Asp-74 to Leu-79. Asp-1 13 to Glu-121. Leu-157 to Val-166, AIa-189 to Arg-196. ' 
Gln-206 to Arg-21 1. Polynucleotides encoding said polypeptides are also provided. 
45 The tissuc distribution in brain, paniculary in the cerebellum, indicates 

polynucleotides and polypeptides corresponding to this gene are useful for the 
30 detection, treatment, and/or prevention of neurodegenerative disease states, behavioral 
disorders, or inflammatory conditions. Representative uses are described in the 
50 . "Regeneration" and "Hyperproliferati ve Disorders" sections below, in Example 1 1 . 
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15, and 18, and elsewhere herein. Briefly, the uses include, but are not limited to the 
detection, treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease, 
10 Huntington^ Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 

diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
5 cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 

mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain indicates it plays a role in 
10 normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. The tissue distribution in epididymus indicates that polynucleotides and 
25 polypeptides corresponding to this gene are useful for the treatment and diagnosis of 

15 conditions concerning proper testicular function (e.g. endocrine function, sperm 

maturation), as well as cancer. Therefore, this gene product is useful in the treatment 
of male infertility and/or impotence. This gene product is also useful in assays 
designed to identify binding agents, as such agents (antagonists) are useful as male 
contraceptive agents. Similarly, the protein is believed to be useful in the treatment 
20 and/or diagnosis of testicular cancer. The testes are also a site of active gene 
35 expression of transcripts that is expressed, particularly at low levels, in other tissues 

of the body. Therefore, this gene product is expressed in other specific tissues or 
organs where it may play related functional roles in other processes, such as 
hematopoiesis, inflammation, bone formation, and kidney function, to name a few 
25 possible target indications. The expression in the breast tissue may indicate its uses in 
the diagnosis and/or treatment of breast neoplasia and breast cancers, such as 
fibroadenoma, pipillary carcinoma, ductal carcinoma, Paget's Disease, medullary 
45 carcinoma, mucinous carcinoma, tubular carcinoma, secretory carcinoma and 

apocrine carcinoma, as well as juvenile hypertrophy and gynecomastia, mastitis and 
30 abscess, duct ectasia, fat necrosis and fibrocystic diseases. Moreover, the expression 
within embryonic tissue and other cellular sources marked by proliferating cells 
50 indicates this protein may play a role in the regulation of cellular division, and may 
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show utility in the diagnosis, treatment, and/or prevention of developmental diseases 
and disorders, including cancer, and other proliferative conditions. Representative 
uses are described in the "Hyperproliferative Disorders" and "Regeneration" sections 
below and elsewhere herein. Briefly, developmental tissues rely on decisions 
involving cell differentiation and/or apoptosis in pattern formation. 

Dysrcgulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA), Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
delecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate Ugands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ iD NO: 53 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
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formula of a-b, where a is any integer between 1 to 1 846 of SEQ ID NO:53, b is an 
integer of 15 to 1 860, where both a and b correspond to the positions of nucleotide 
10 residues shown in SEQ ID NO:53, and where b is greater than or equal to a + 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 44 

Contact of cells with supernatant expressing the product of this gene increases 
the permeability of monocytes to calcium. Thus, it is likely that the product of this 
gene is involved in a signal transduction pathway that is initiated when the product of 
this gene binds a receptor on the surface of the monocyte cell. Thus, polynucleotides 
and polypeptides have uses which include, but arc not limited to, activating monocyte 
cells. 

This gene is expressed primarily in CD34 positive cells derived from human 
cord blood. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

15 reagents for differential identification of the tissue(s) or cell typc(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, hematopoietic disorders; immune dysfunction; defects in hematopoietic 
stem and progenitor cells; susceptibility to chemotherapy and irradiation. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
35 a numbcr of disorders of the above tissues or cells, particularly of the immune system, 

expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., immune, cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e.. the expression level in healthy tissue or bodily fluid from 
an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 167 as residues: Ala-38 to Leu-59, Ala-63 to Thr-71, 
Lys-82 to Leu-91, Glu-97 to Ser-107, Gln-143 to Ala-149, Ile-153 to Leu-158, Ser- 
169 to Arg-182. Polynucleotides encoding said polypeptides are also provided. 
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Elevated expression of this gene product in CD34 positive hematopoietic cells 
indicates that it is expressed by early stem and progenitor cells of the hematopoietic 
10 lineages. Therefore, this may represent a soluble factor that is able to control the 

survival, proliferation, differentiation, or activation of all hematopoietic lineages, 
5 including stem and progenitor cells. Thus, it could be quite useful, for example, in ex 
vivo expansion of stem cell numbers for hematopoietic disorders or for cancer 
15 patients. Alternately, it may represent a factor that influences the hematopoietic 

microenvironment by affecting stromal cells that release other factors required for 
hematopoietic development Additionally, the tissue distribution in CD34 positive 
2Q 10 cclIs also indicates polynucleotides and polypeptides corresponding to this gene are . 

useful for the treatment and diagnosis of hematopoietic related disorders such as 
anemia, pancytopenia, leukopenia, thrombocytopenia or leukemia since stromal cells 
are important in the production of cells of hematopoietic lineages. Representative uses 
25 are described in the "Immune Activity" and "infectious disease" sections below, in 

15 Example 1 1, 13, 14, 16, 18, 19. 20, and 27, and elsewhere herein. Briefly, the uses 

include bone marrow cell ex-vivo culture, bone marrow transplantation, bone marrow 
reconstitution, radiotherapy or chemotherapy of neoplasia. 

The gene product may also be involved in lymphopoiesis, therefore, it can be 
used in immune disorders such as infection, inflammation, allergy, immunodeficiency 
etc. In addition, this gene product may have commercial utility in the expansion of 
35 stem ceIls and committed progenitors of various blood lineages, and in the 

differentiation and/or proliferation of various cell types. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:54 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded rrom the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
30 formula of a-b, where a is any integer between 1 to 756 of SEQ ID NO:54, b is an 
integer of 15 to 770, where both a and b correspond to the positions of nucleotide 
50 residues shown in SEQ ID NO:54, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 45 

10 ™ S gCne is CX P nssei P rimari "y breast and 12-week old human embryos 

and to a lesser extent in stomach cancer and liver. 

5 Therefore, polynucleotides and polypeptides of the invention are oseful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
b.o.ogicaj sampl. and for diagnosis of diseases and conditions which include, bu, are 
no, hrrated to, breast cancer; stomach cancer, embryonic defects; hepatic disorders 
Smularly, polypeptides and anybodies directed to these polypeptides are useful i„ 
proving immunological probes for differentia, identification of the ti S sue(s) or cel. 

type*). For , number of of the above t . ssues Qr ccik panicui ^ ^ 

d,ges,,ve and endocrine systems, expression of this gene at significantly higher or 
lower levels is routinely detected in cenain tissues or ce.l types (e.g., cancerous and 
wounded tissues) or bodi.y fluids (e.g., serum, plasma, urine, synovia, fluid and spinal 
flu.d) or another tissue or cel. samp.e taken from an individua, hav ing such a disorder 
revive to the standard gene expression level, i.e.. the expression level in healthy ' 
t,ssue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates mat the protein products of this gene are 
useful forthe diagnosis and/or treatment of a variety of disorders. Elevated expression 
of ,„,s gene product in stomach cancer indicates it is useful as a marker or therapeutic 
t3rgCt fM St ° maCh CanCCr - Ato »-'* «ion in breast tissue is influenced by , he 
presence or absence of breast cancer tissue, and may thus a .so serve as a diagnostic 
marker for this cancer as we... Expression in the developing embryo may correlate 
wuh the normal development of human embryhos. and expression m the liver is 
mvolved in the relation of norma, liver function and/or .iver regeneration 
Many polynucleotide sequences, such as EST sequences, are publicly 
ava,.ab.e and accessib.e through sequence databases. Some of these sequences are 
related to SBQ ,D NO:55 and may have been publicly available prior .conception of 
the present invention. Preferably, such related po.ynuc.eo.ides are specified* 
excluded from the scope of the present invention. To list every re.ated sequence ,s 
cumbersome. Accord.g.y, prefe rab. y excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
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formula of a-tx where a is any integer between I to 1079 of SEQ ID NO:55, b is an 
integer of 15 to 1093. where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 55, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 46 

This gene is expressed primarily in human hypothalamus derived from a 
patient with schizophrenia. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential, identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, bu.t are 
not limited to. schizophrenia; neurological disorders; impaired nervous system 
function. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., neural, 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 169 as residues: Glu-34 to Trp-39. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in brain, particularly in the hypothalamus, indicates 
polynucleotides and polypeptides corresponding to this gene are useful for the 
detection, treatment, and/or prevention of neurodegenerative disease states, behavioral 
disorders, or inflammatory conditions. Representative uses are described in the 
"Regeneration" and "Hyperproliferative Disorders'* sections below, in Example 1 1, 
15, and 18, and elsewhere herein. Briefly, the uses include, but are not limited to the 
detection, treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
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cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain indicates it plays a role in 
normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional ' 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:56 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 618 of SEQ ID NO:56, b is an 
integer of 15 to 632, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:56, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 47 

The translation product of this gene shares sequence homology with human 
lecithin-cholesterol acyltransferase (LCAT). which catalyses the transfer of fatty acid 
from the sn-2 position of lecithin to the free hydroxyl group of cholesterol. Preferred 
polypeptides of the invention comprise the following amino acid sequence: RLVYN 
KTSRATQFPDGVDVRVPGFGKTFSLEFLDPSKSSVGSYFHTMVESLVGWGYT 
RGEDVRGAPYDWRRAPNENGPYFLALREMrEEMYQLYGGPVVLVAHSMGN 
MYTLYFLQRQPQAWFCDKYIRAFV'SLGAPWGGVAKTLRVLASGDNNRIPVIG 
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PLKJREQQRSAVSTSWLLPYKYTWSPEKWVQTPTINYTLRDYRKFFQDIGFE 
DGWLMRQDTEGLVEATMPPGVQLHCLYGTGVPTPDSFYYESFPDRDPKICFG 
i0 DGDGTVNLKSALQCQAWQSRQEHQVLLQELPGSEHIEMLANATTLAYLKRV 
LLGP (SEQ ID NO: 318). Polynucleotides encoding such polypeptides are also 
5 provided. 

This gene is expressed primarily in osteoblasts & dendritic cells and to a lesser 
15 extent in muscle and other hematopoietic cell lineages. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, hematopoietic disorders; immune dysfunction; osteoporosis; 
osteopetrosis; muscle degeneration. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
25 identification of the tissue(s) or cell type(s). For a number of disorders of the above 

tissues or cells, particularly of the skeletal and immune systems, expression of this 
gene at significantly higher or lower levels is routinely detected in certain tissues or 
cell types (e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial Huid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO; 170 as residues: Cys-65 to Ser-71. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution and homology to lecithin-cholesterol acyltransferase 
(LCAT) indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for the diagnosis and/or treatment of a variety of disorders. For example, 
artheroscelerosis is a pathological condition of mammals characterised by the 
accumulation of cholesterol in the arteries, which leads to heart disease, strokes, heart 
attacks and peripheral vascular disease. The enzyme could be used in a novel method 
of treating atherosclerosis, which involves increasing the level of LCAT activity, 
50 which then causes a decrease in the accumulation of cholesterol. The method and the 
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products can be used for the prophylaxis and treatment of atherosclerosis, and 
associated heart disease, myocardial infarction, stroke and peripheral vascular disease, 
as well as individuals suffering from Fish Eye Syndrome (caused by LCAT 
deficiency) or Classic LCAT Deficiency Syndrome. Alternately, elevated expression 
of this gene product in osteoblasts and hematopoietic cell lineages indicates that it 
may piay additional roles in bone turnover, regulation of immune system function, 
and muscular function. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 57 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically ■ 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2673 of SEQ ID NO:57, b is an 
integer of 15 to 2687, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:57, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 48 

When tested against HELA epithelial cell lines, supernatants removed from 
cells containing this gene activated the GAS assay. Thus, it is likely that this gene 
activates epithelial cells through the Jak-STAT signal transduction pathway. The 
gamma activating sequence (GAS) is a promoter element found upstream of many 
genes which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a 
large, signal transduction pathway involved in the differentiation and proliferation of 
cells. Therefore, activation of the Jak-STAT pathway, reflected by the binding of the 
GAS element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

This gene is expressed primarily in adult brain, infant brain, fibroblasts, 
embryonic and fetal tissue (e.g., spleen, liver), placenta and to a lesser extent in 
endocrine organs, cancerous colon and breast. 



WO 00/06698 



125 



PCT/US99/I7130 



Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and tor diagnosis of diseases and conditions which include, but are 
not limited to, dementia, epilepsy, schizophrenia, and developmental abnormalities. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunologicai probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
neural system, endocrine system, and during development, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in brain indicates polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions. 
Representative uses are described in the "Regeneration" and "Hyperproliferative 
Disorders" sections below, in Example 1 1, 15, and 18, and elsewhere herein. Briefly, 
the uses include, but arc not limited to the detection, treatment, and/or prevention of 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated expression of this gene product 
in regions of the brain indicates it plays a role in normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. In addition, the expression of this gene product in synovium (synovial 
sarcoma) would suggest a role in the detection and treatment of disorders and 
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conditions afflicting the skeletal system, in particular osteoporosis, bone cancer, 
connective tissue disorders (e.g. arthritis, trauma, tendonitis, chrondomalacia and 
inflammation). The protein is also useful in the diagnosis or treatment of various 
autoimmune disorders (i.e., rheumatoid arthritis, lupus, scleroderma, and 
dcrmatomyositis). dwarfism, spinal deformation, joint abnormalities, and 
chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, familial 
osteoarthritis, Atelosteogenesis type II, metaphyseal chondrodysplasia type Schmid, 
etc.). The tissue distribution in endocrine tissues indicates polynucleotides and 
polypeptides corresponding^) this gene are useful for the detection, treatment, and/or 
prevention of various endocrine disorders and cancers. Representative uses are 
described in the "Biological Activity", "Hyperproliferative Disorders", and "Binding ' 
Activity" sections below, in Example 11, 17, 18, 19, 20 and 27, and elsewhere herein. 
Briefly, the protein can be used for the detection, treatment, and/or prevention of 
Addison's Disease, Cushings Syndrome, and disorders and/or cancers of the 
pancrease (e.g. diabetes meilitus), adrenal cortex, ovaries, pituitary (e.g., hyper-, 
hypopituitarism), thyroid (e.g. hyper-, hypothyroidism), parathyroid (e.g. hyper- 
.hypoparathyroidism) , hypothallamus, and testes. Additionally, the expression within 
fetal tissue, cancerous colon and breast, and other cellular sources marked by 
proliferating cells indicates this protein may play a role in the regulation of cellular 
division, and may show utility in the diagnosis, treatment, and/or prevention of 
developmental diseases and disorders, including cancer, and other proliferative 
conditions. Representative uses are described in the "Hyperproliferative Disorders" 
and "Regeneration'' sections below and elsewhere herein. Briefly, developmental 
tissues rely on decisions involving cell differentiation and/or apoptosis in pattern 
formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
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polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, ' 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:58 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 605 of SEQ ID NO:58, b is an 
integer of 15 to 619, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:58, and where b is greater than or equal to a+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 49 

Preferred polypeptides of the invention comprise the following amino acid 
sequence or a subfragment thereof: MNKEDKVWNDCKGVNKLTNLEEQYIILIFQ 
NGLDPPANMVFESIINEIGIKNNISNFFAKIPFEEANGRLVACTRTYEESIKGSC 
GQKENKIKTVSreSKIQLRSKQEFQFFDEEEETGENHTIFlGPVEKLIVYPPPPA 
KGGISVTNEDLHCLNEGEFLNDVI1DFYLKYLVLEKLKKEDADRIHIFSSFFYK 
RLNQRERRNHETTNLSIQQK^HGRVKTWTRHVDIFEKDFIFVPLNEAAHWFL 
AVVCFPGLEKPKYEPNPHYHENAVIQKCSTVEDSaSSSASEMESCSQNSSAK 
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PVIKXMLNKKHCIAVIDSNPGQEESDPRYKRNICSVKYSVKKINHTASENEEF 
NKGESTSQKS (SEQ ID NO: 319). Polynucleotides encoding such polypeptides are 
also provided. 

This gene is expressed primarily in fetal tissue, stomach, brain, endometrial 
cells, and bone and to a lesser extent in prostate, retina, adipocytes, smooth muscle, 
and tumors of the endometrium, ovaries, and parathyroid. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample anchor diagnosis of diseases and conditions which include, but are 
not limited to, disorders of the endocrine system, ulcers, stomach cancer, epilepsy, 
schizophrenia, dementia, bone growth, developmental disorders and resorption. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
digestive system and neural systems expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., neural, 
endocrine system, cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

The tissue distribution in brain indicates polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions. 
Representative uses are described in the "Regeneration" and "Hyperproliferative 
Disorders" sections below, in Example 1 1, 15, and 18, and elsewhere herein. Briefly, 
the uses include, but are not limited to the detection, treatment, and/or prevention of 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
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psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated expression of this gene product 
in regions of the brain indicates it plays a role in normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Expression of this gene product in stomach tissue indicates involvement in 
digestion, processing, and elimination of food, as well as a potential rote for this gene 
as a diagnostic marker or causative agent in the development of stomach cancer, and 
cancer in general. The expression within embryonic, fetal tissue and other cellular 
sources marked by proliferating cells indicates this protein may play a role in the 
regulation of cellular division, and may show utility in the diagnosis, treatment, 
and/or prevention of developmental diseases and disorders, including cancer, and 
other proliferative conditions. Representative uses are described in the 
"Hyperproliferative Disorders" and "Regeneration" sections below and elsewhere 
herein. Briefly, developmental tissues rely on decisions involving cell differentiation 
and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. The tissue distribution 
in parathyroid tumor indicates polynucleotides and polypeptides corresponding to this 
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gene are useful for the detection, treatment, and/or prevention of various endocrine 
disorders and cancers. Representative uses are described in the "Biological Activity", 
"Hyperproliferative Disorders", and "Binding Activity" sections below, in Example 
11, 17, 18, 19, 20 and 27, and elsewhere herein. Briefly, the protein can be used for 
the detection, treatment, and/or prevention of Addison's Disease, Cushing's 
Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes mellitus), 
adrenal cortex, ovaries, pituitary (e.g.. hyper-, hypopituitarism), thyroid (e.g. hyper-, 
hypothyroidism), parathyroid (e.g. hyper-,hypoparathyroidism) , hypothallamus, and 
testes. The tissue distribution in testes indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment and diagnosis of 
conditions concerning proper testicular function (e.g. endocrine function, sperm 
maturation), as well as cancer. Therefore, this gene product is useful in the treatment 
of male infertility and/or impotence. This gene product is also useful in assays 
designed to identify binding agents, as such agents (antagonists) are useful as male 
contraceptive agents. Similarly, the protein is believed to be useful in the treatment 
and/or diagnosis of testicular cancer. The testes are also a site of active gene 
expression of transcripts that is expressed, particularly at low levels, in other tissues 
of the body. Furthermore, the protein may also be used to determine biological 
activity, raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:59 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1364 of SEQ ID NO: 59, b is an 
integer of 15 to 1378, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:59, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 50 

The translation product of this gene shares good protein homology with 
Xenopus NaDC-2 gene and a rabbit renal sodiurn/dicarboxylate cotransporter. The 
translation product of this gene also shares good homology with a rat placental protein 
which is a sodium-coupicd high affinity dicarboxylate transporter. Therefore, it is 
likely that that the translated product encoded by this gene shares similar biological 
activity. 

This gene is expressed primarily in the placenta and colon adenocarcinoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a ' 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental abnormalities as well as failure to thrive anomalies. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
female reproductive system and colon, expression of this gene at significantly higher 
or lower levels is routinely detected in certain tissues or cell types (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., amniotic, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from an individual having such 
a disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 173 as residues: Lys- 166 to Gly- 18 1. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in human placenta and the shared homology of this 
translation product to a rat placental protein indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and/or treatment 
of disorders of the placenta. Specific expression within the placenta indicates that this 
gene product may play a role in the proper establishment and maintenance of 
placental function. Alternately, this gene product is produced by the placenta and then 
transported to the embryo, where it may play a crucial role in the development and/or 
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survival of the developing embryo or fetus. Expression of this gene product in a 
vascular-rich tissue such as the placenta also indicates that this gene product is 
10 produced more generally in endothelial ceils or within the circulation. In such 

instances, it may play more generalized roles in vascular function, such as in 
5 angiogenesis. It may also be produced in the vasculature and have effects on other 
cells within the circulation, such as hematopoietic cells. It may serve to promote the 
15 proliferation, survival, activation, and/or differentiation of hematopoietic cells, as well 

as other cells throughout the body. The tissue distribution in colon tissue indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for the 
2Q 10 dia gn°sis and/or treatment of disorders involving the colon. Expression of this gene 

product in colon tissue indicates involvement in digestion, processing, and 
elimination of food, as well as a potential role for this gene as a diagnostic marker or 
causative agent in the development of colon cancer, and cancer in general. 
25 Man y polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences arc 
related to SEQ ID NO. 60 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention arc one or 
more polynucleotides comprising a nucleotide sequence described by the general 
35 formula of a-b, where a is any integer between 1 to 1 1 12 of SEQ ID NO:60, b is an 

integer of 15 to 1 1 26, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:60, and where b is greater than or equal to a + 14. 

40 25 FEATURES OF PROTEIN ENCODED BY GENE NO: 51 

This gene is expressed primarily in the spinal cord. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
45 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
30 not limited to, paralysis, neurologic disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
50 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
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the above tissues or cells, particularly of the nervous system, expression of this gene 
at significantly higher or lower levels is routinely delected in certain tissues or cell 
types (e.g., neural, cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level ia healthy tissue or bodily fluid from an individual not 
having the disorder. 

The tissue distribution in spinal cord indicates polynucleotides and 
polypeptides corresponding to this gene are useful for the detection, treatment, and/or 
prevention of neurodegenerative disease states, behavioral disorders, or inflammatory 
conditions. Representative uses are described in the "Regeneration" and 
"Hyperproliferative Disorders" sections below, in Example 11, 15, and 18, and 
elsewhere herein. Briefly, the uses include, but are not limited to the detection, 
treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease. Tourette Syndrome, meningitis, encephalitis, demyeiinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain indicates it plays a rote in 
normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival, Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ TD NO:61 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2064 of SEQ ID NO:6l, b is an 
integer of 15 to 2078, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:61, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO; 52 

This gene is expressed primarily in keratinocytes, brain, fetal tissues, 
pericardium, stomach, and cancerous tissues (e.g., stomach, adrenals, parathyroid, ' 
germ cell, colon, breast). 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, skin disorders, neurodegenerative and developmental disorders, heart 
disease, and cancers. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the cardiovascular and gastrointestinal systems, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., neural, immune, cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution in brain indicates polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions. 
Representative uses are described in the "Regeneration" and "Hyperproliferative 
Disorders" sections below, in Example 1 1. 15. and 18, and elsewhere herein. Briefly, 
the uses include, but arc not limited to the detection, treatment, and/or prevention of 
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Alzheimer's Disease, Parkinson s Disease, Huntington's Disease, Tourettc Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated expression of this gene product 
in regions of the brain indicates it plays a role in normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. The tissue distribution in keratinocytes indicates polynucleotides and ' 
polypeptides corresponding to this gene are useful for the treatment, diagnosis, and/or 
prevention of various skin disorders. Representative uses are described in the 
"Biological Activity", "Hyperproliferative Disorders", "infectious disease", and 
15 "Regeneration" sections below, in Example i 1, 19. and 20, and elsewhere herein. 
Briefly, the protein is useful in detecting, treating, and/or preventing congenital 
disorders (i.e. ncvi, moles, freckles, Mongolian spots, hemangiomas, port-wine 
syndrome), integumentary tumors (i.e. keratoses, Bowen s Disease, basal cell 
carcinoma squamous cell carcinoma, malignant melanoma, Paget's Disease, mycosis 
fiingoides, and Kaposi's sarcoma), injuries and inflammation of the skin (i.e. wounds, 
rashes, prickly heat disorder, psoriasis, dermatitis), atherosclerosis, uticaria, eczema, 
photosensitivity, autoimmune disorders (i.e. lupus erythematosus, vitiligo, 
dermatomyositis. morphea, scleroderma, pemphigoid, and pemphigus), keloids, striae, 
erythema, petechiae, purpura, and xanthelasma. In addition, such disorders may 
predispose increased susceptibility to viial and bacterial infections of the skin (i.e. 
cold sores, warts, chickenpox, molJuscum contagiosum, herpes zoster, boils, cellulitis, 
erysipelas, impetigo, tinea, althletes foot, and ringworm). Moreover, the protein 
product of this gene may also be useful for the treatment or diagnosis of various 
connective tissue disorders (i.e., arthritis, trauma, tendonitis, chrondomaiacia and 
inflammation, etc.), autoimmune disorders (Le., rheumatoid arthritis, lupus, 
scleroderma, dermatomyositis, etc.), dwarfism, spinal deformation, joint 
abnormalities, amd chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, 
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familial osteoarthritis, Atelosteogenesis type n, metaphyseal chondrodysplasia type 
Schmid). The expression within fetal tissue (e.g., spleen and liver) and other cellular 
sources marked by proliferating cells indicates this protein may play a role in the 
regulation of cellular division, and may show utility in the diagnosis, treatment, 
and/or prevention of developmental diseases and disorders, including cancer, and 
other proliferative conditions. Representative uses are described in the 
"Hyperproliferative Disorders" and "Regeneration" sections below and elsewhere 
herein. Briefly, developmental tissues rely on decisions involving cell differentiation 

and/or apoptosis in pattern formation. 

f 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers, tt may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response lo aberrant polypeptides, as may exist in 
proliferating and cancerous ceils and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Additionally, the 1 
tissue distribution in the pericardium of the heart indicates that the protein is useful in 
the detection, treatment, and/or prevention of a variety of vascular disorders and 
conditions, which include, but are not limited to miscrovascular disease, vascular leak 
syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery disease, 
arteriosclerosis, and/or atherosclerosis. Furthermore, the protein may also be used to 
determine biological activity, to raise antibodies, as tissue markers, to isolate cognate 
Hgands or receptors, to identify agents that modulate their interactions, in addition to 
its use as a nutritional supplement. Protein, as well as, antibodies directed against the 
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protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ TD NO:62 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 748 of SEQ ID NO:62, b is an 
integer of 15 to 762, where both a and b correspond to the positions of nucleotide ' 
residues shown in SEQ ID NO:62, and where b is greater than or equal to a + 14. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 53 

15 This gene is expressed primarily in the brain and in cartilage and to a lesser 

extent in the retina, activated T-cells, pineal gland, the lungs, and in synovial 
sarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
35. not 1United to ' nosological diseases, such as epilepsy and dementia, osteoarthritis, 

retinopathies, hematopoietic diseases, emphysema, and lung cancer . Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
25 a number of disorders of the above tissues or cells, particularly of the neurologic 
system, cartilage and musculature, vision, the hematopoietic system, and the 
pulmonary system expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., neural, immune, cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal 
30 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
50 tissue or bodily fluid from an individual not having the disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 176 as residues: Arg-34 to Cys-44. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in brain indicates polynucleotides and polypeptides 
corresponding to this gene are useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions. 
Representative uses are described in the "Regeneration" and "Hyperproliferative 
Disorders" sections below, in Example II , 15, and 18, and elsewhere herein. Briefly, 
the uses include, but are not limited to the detection, treatment, and/or prevention of 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS. 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated expression of this gene product 
in regions of the brain indicates it plays a role in normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. The tissue distribution in T-cells indicates polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of a variety of 
immune system disorders. Representative uses are described in the "Immune 
Activity" and "infectious disease" sections below, in Example 11,13, 14, 16, 18, 19, 
20, and 27, and elsewhere herein. Briefly, the expression of this gene product 
indicates a role in regulating the proliferation; survival; differentiation; and/or 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
other processes suggesting a usefulness in the treatment of cancer (e.g. by boosting 
immune responses ). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
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as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
10 sucn as T-ceN mediated cytotoxicity; immune reactions to transplanted organs and 

tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
5 disorders, such as autoimmune infertility, lense tissue injury, dcmyelination, systemic 
lupus crythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
15 Disease, and scleroderma Moreover, the protein may represent a secreted factor that 

influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
20 10 thc expansion of stem cells and committed progenitors of various blood lineages, and 

in the differentiation and/or proliferation of various cell types. The expression of this 
gene product in synovium would suggest a role in the detection and treatment of 
disorders and conditions afflicting the skeletal system, in particular osteoporosis, bone 
25 cancer, connective tissue disorders (e.g. arthritis, trauma, tendonitis, chrondomalacia 

15 and inflammation). The protein is also useful in the diagnosis or treatment of various 
autoimmune disorders (i.e., rheumatoid arthritis, lupus, scleroderma, and 
dermatomyositis), dwarfism, spinal deformation, joint abnormalities, and 
chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, familial 
osteoarthritis, Atelosteogenesis type II, metaphyseal chondrodysplasia type Schmid, 
20 etc.). Additionally, the expression within fetal tissue and other cellular sources 
35 marked by proliferating cells indicates this protein may play a role in the regulation of 

cellular division, and may show utility in thc diagnosis, treatment, and/or prevention 
of developmental diseases and disorders, including cancer, and other proliferative 
conditions. Representative uses are described in the "Hyperproliferative Disorders' 1 
and "Regeneration" sections below and elsewhere herein. Briefly, developmental 
tissues rely on decisions involving cell differentiation and/or apoptosis in pattern 
formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
30 of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
50 potential roles in proliferation and differentiation, this gene product may have 
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applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:63 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-h, where a is any integer between 1 to 1080 of SEQ ID NO:63, b is an 
integer of 15 to 1094, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:63, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 54 

This gene is expressed primarily in umbilical vein endothelial cells induced by 

LL-4. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
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not limited to, angiogenesis, inflammatory disorders, hematopoietic disease. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
angiogenic and hematopoietic systems, expression of this gene at significantly higher 
or lower levels is routinely detected in certain tissues or cell types (e.g., cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue or bodily fluid from' an individual not having the disorder. 

The tissue distribution in endothelial cells indicates polynucleotides and 
polypeptides corresponding to this gene are useful in the detection, treatment, and/or 
prevention of vascular conditions, which include, but are not limited to, microvascular 
disease, vascular leak syndrome, aneurysm, stroke, atherosclerosis, arteriosclerosis, or 
embolism. For example, this gene product may represent a soluble factor produced by 
smooth muscle that regulates the innervation of organs or regulates the survival of 
neighboring neurons. Likewise, it is involved in controlling the digestive process, and 
such actions as peristalsis. Similarly, it is involved in controlling the vasculature in 
areas where smooth muscle surrounds the endothelium of blood vessels. Furthermore, 
the protein may also be used to determine biological activity, to raise antibodies, as 
tissue markers, to isolate cognate ligands or receptors, to identify agents that modulate 
their interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. The secreted protein can also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, and as 
nutritional supplements. It may also have a very wide range of biological activities. 
Representative uses are described in the "Chemotaxis" and "Binding Activity" 
sections below, in Examples II, 12, 13, 14, 15, 16, 18, 19, and 20. and elsewhere 
herein. Briefly, the protein may possess the following activities: cytokine, cell 
proliferation/differentiation modulating activity or induction of other cytokines; 
immunosu'mulating/immunosuppressanl activities (e.g. for treating human 
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immunodeficiency vims infection, cancer, autoimmune diseases and allergy); 
regulation of hematopoiesis (e.g. for treating anemia or as adjunct to chemotherapy); 
stimulation or growth of bone, cartilage, tendons, ligaments and/or nerves (e.g. for 
treating wounds, stimulation of follicle stimulating hormone (for control of fertility); 
chemotactic and chemokinetic activities (e.g. for treating infections, tumors); 
hemostatic or thrombolytic activity (e.g. For treating hemophilia, cardiac infarction 
etc.); anti-inflammatory activity (e.g. for treating septic shock, Crohn's Disease); as 
antimicrobials; for treating psoriasis or other hyperproliferative diseases; for 

regulation of metabolism, and behavior. Also contemplated is the use of the 

■J" 

corresponding nucleic acid in gene therapy procedures. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:64 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 1347 of SEQ ID NO:64, b is an 
integer of 15 to 136 1 . where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:64, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 55 

This gene is expressed primarily in both normal and cancerous pancreas. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, diabetes, gastrointestinal disorders, and pancreatic cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the digestive and 
blood systems, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., cancerous and wounded 
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tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 

The tissue distribution in pancreas indicates that the protein products of this 
gene are useful as a therapeutic and/or diagnostic agent for pancreatic disorders and 
disorders of the endocrine and exocrine system, including but not limited to diabetes, 
blood disorders, pancreatic cancer, gastrointestinal diseases, hormomul imbalance, 
autoimmune disorders, cystic fibrosis, pancreatitis, and gallstones. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate Iigands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:65 and may have been publicly available prior 10 conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 933 of SEQ ID NO:65, b is an 
integer of 15 to 947, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:65, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 56 

The translation product of this gene shares sequence homology with 
oxidoreductasc. Preferred polypeptides of the invention comprise the following amino 
acid sequence: MSPLS AARAALRVYAVGAAVILAQLLRRCRGGFLEPVXPPRP 
DRVAIVTGGTDGIGYSTANlWRDLGMHVnAGNNDSKAKQVVSKIKEETLND 
KVEFLYCDLASMTSIRQFVQFGTCMKKIPLHVLINNAGVMMVPQRKTRDGFEE 
HFGLNYLGHFLLTNLLLDTLKESGSPGHSARVVTVSSATHYVAELNMDDLQS 
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SACYSPHAAYAQSKLALVLFTYHLQRLLAAEGSHVTANVVDPGVVNTDXYK 
HVFWATRLAKK^LGWIXFKTPDEGAWSIYAAVTPELEGVGGRYLYNEKET 
KSLHVTYNQKLQQQLWSKSCEMTG VLDVTL (SEQ ID NO: 320). The mature 
form of this protein begins at residue 32. Thus, polypeptides comprising residues 2- 
330 and 32-330 of the sequence shown above are also provided. Polynucleotides 
encoding such polypeptides are also provided. 

A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MSPLSAARAALRVYAVGAAVILAQLLRRCRGGFLEP 
VXPPRPDRVATVTGGTEJpiG YSTANIWRDLACMLS (SEQ ID NO: 321). 
Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in breast cancer cells, osteoclastoma, wilmV 
tumor, thymus stromal cells, and T cell helper I. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer, e.g., breast cancer, osteoclastoma, and wilm's tumor. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell typc(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., reproductive, kidney, immune, hematopoietic, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, breast milk, 
urine, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in breast cancer tissue, combined with the homology to 
oxidoreductase indicates that polynucleotides and polypeptides corresponding to this 
gene are useful for diagnosis and treatment of cancer, particularly, breast cancer, 
osteoclastoma, and wilm's tumor. This protein may play a role in the regulation of 
cellular division, and may show utility in the diagnosis, treatment, and/or prevention 
of developmental diseases and disorders, including cancer, and other proliferative 
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conditions. Representative uses are described in the "Hyperproliferative Disorders" 
and "Regeneration" sections below and elsewhere herein. Briefly, developmental 
tissues rely on decisions involving cell differentiation and/or apoptosis in pattern 
formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morpbogen to control cell and tissue type specification. Therefore, the - 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:66 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides arc specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1362 of SEQ ID NO:66, b is an 
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integer of 15 to 1376, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 66, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 57 

This gene is expressed primarily in monocytes, T cell helper II and B cell 
lymphoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the rissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune and hematopoietic diseases and/or disorders, particularly B- 
cell lymphoma. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., immune, 
hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 180 as residues: Asp-30 to Val-40. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in monocytes, T cell helper, and B cell lymphoma cells 
indicates that polynucleotides and polypeptides corresponding to this gene are useful 
for diagnosis and treatment of B cell lymphoma. Representative uses are described in 
the "Immune Activity" and "infectious disease" sections below, in Example 11, 13, 
14, 16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the expression of this gene 
product indicates a role in regulating the proliferation; survival; differentiation; and/or 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
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other processes suggesting a usefulness in the treatment of cancer (e.g. by boosting 
immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:67 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 2420 of SEQ ID NO:67, b is an 
integer of 15 to 2434, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:67, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 58 
This gene is expressed primarily in human lung cancer. 
10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
5 bio:ogica; sample and for diagnosis of diseases and conditions which include, but are 
not limited to, pulmonary diseases and/or disorders, particularly cancers of the lung. 
15 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
2Q 10 immune system, expression of this gene at significantly higher or lower levels is 

routinely detected in certain tissues or cell types (e.g., pulmonary, and cancerous and* 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, pulmonary lavage, 
pulmonary surfactant, synovial fluid and spinal fluid) or another tissue or cell sample 
25 taken from an individual having such a disorder, relative to the standard gene 

15 expression level, i.e„ the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 181 as residues: Phe-39 to Asp-45. Polynucleotides 
encoding said polypeptides are also provided. 
20 The tissue distribution in lung cancer tissue indicates that polynucleotides and 

35 polypeptides corresponding to this gene arc useful for diagnosis and treatment of 

immune system disorders such as ARDS, cystic fibrosis, and cancer, particularly lung 
cancer. This protein may play a role in the regulation of cellular division, and may 
show utility in the diagnosis, treatment, and/or prevention of developmental diseases 
and disorders, including cancer, and other proliferative conditions. Representative 
uses are described in the "Hyperprotiferative Disorders" and "Regeneration" sections 
below and elsewhere herein. Briefly, developmental tissues rely on decisions 
involving cell differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
30 death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
50 neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
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potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, ant/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, arc publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:68 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1072 of SEQ ID NO:68, b is an 
integer of 15 to 1086, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:68, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 59 

This gene is expressed primarily in larynx carcinoma and early stage human 

lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell typc(s) present in a 
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biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental, gastrointestinal, and pulmonary diseases and/or 
disorders, particularly larynx carcinoma. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or ceils, particularly of the immune system, expression of this gene 
at significandy higher or lower levels is routinely detected in certain tissues or cell 
types (e.g., developmental, gastrointestinal, pulmonary, and cancerous and wounded 

tissues) or bodily fluids (e.g.* serum, plasma, amniotic fluid, pulmonary lavage, 

f 

sputum, urine, synovial fluid and spinal fluid) or another tissue or ceil sample taken 
from an individual having such a disorder, relative to the standard gene expression* 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 182 as residues: His-42 to Lys-49. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in larynx carcinoma and early stage human lung 
indicates that polynucleotides and polypeptides corresponding to this gene are useful 
for treating immune system disorders such as cancer, particularly larynx carcinoma. 
This protein may play a role in the regulation of cellular division, and may show 
utility in the diagnosis, treatment, and/or prevention of developmental diseases and 
disorders, including cancer, and other proliferative conditions. Representative uses are 
described in the "Hyperproliferative Disorders" and "Regeneration" sections below 
and elsewhere herein. Briefly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of ceil 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and lissue type specification. Therefore, the 
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polynucleotides and polypeptides of the present invention are use fill in treating, 

detecting, and/or preventing said disorders and conditions, in addition to other types 

of degenerative conditions. Thus this protein may modulate apoptosis or tissue 

differentiation and is useful in the detection, treatment, and/or prevention of 

degenerative or proliferative conditions and diseases. The protein is useful in 

modulating the immune response to aberrant polypeptides, as may exist in 

proliferating and cancerous ceils and tissues. The protein can also be used to gain new 

insight into the regulation of cellular growth and proliferation. Furthermore, the 

protein may also be used to determine biological activity, to raise antibodies as tissue 

4' ' 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 

interactions, in addition to its use as a nutritional supplement. Protein, as well as, * 

antibodies directed against the protein may show utility as a tumor marker and/or 

immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:69 and may have been publicly available prior to conception of 

the present invention. Preferably, such related polynucleotides are specifically 

excluded from the scope of the present invention. To list every related sequence is 

cumbersome. Accordingly, preferably excluded from the present invention are one or 

more polynucleotides comprising a nucleotide sequence described by the general 

formula of a-b. where a is any integer between I to 1248 of SEQ ID NO:69, b is an 

integer of 15 to 1 262, where both a and b correspond to the positions of nucleotide 

residues shown in SEQ ID NO:69, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 60 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: MEVTTEDTSRTDVSEPATSGGAADGVTSIAPTAVASSTTAASrTTA 
ASSMTVASSAPTTAASSTTVASL\ITrrASSMTAASSTP^ 
TPSTTATGHPSI^TALAQ VPKSS ALPRTATLATL ATRAQTVATTANTSS PMST 
RPSPSKHMPSDTAASPVPPMXPQAQGPISQVSVDQPVVVTTXKSTXMPSNTT 
XEPLTQAWDKTLLLVVLLLGVTLFITVLVLFALQAYESYKKKDYTQVDYLI 
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NGM YADSEM (SEQ ID NO: 322). Polynucleotides encoding these polypeptides arc 
also provided. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: ARCPELPGLRCRPRPRAGPQ APS YCPR ATRPPG 
ACCARMRLLLEWRVYLRLTCATKDGMARECPTTWLSPPAKPDFAQRHSVK 
PTALQGGRWSRLGASP (SEQ ID NO: 323). Polynucleotides encoding these 
polypeptides are also provided. 

This gene is expressed primarily in adipocytes, osteoblasts, cerebellum, 
hypothalamus and Hodgkin's lymphoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, metabolic, skeletal, neural, and immune diseases and/or disorders, 
particularly Hodgkin's lymphoma. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., metabolic, skeletal, neural immune, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 183 as residues: Pro-33 to Gln-40, Gly-51 to Arg-56. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in Hodgkin's lymphoma cells indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment of immune system disorders such as cancer, particularly Hodgkin's 
lymphoma. The secreted protein can also be used to determine biological activity, to 
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raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify 
agents that modulate their interactions, and as nutritional supplements. It may also 
have a very wide range of biological activities. Representative uses are described in 
the "Chemotaxis" and "Binding Activity" sections below, in Examples 11,12, 13, 14, 
15, 16, 18, 19, and 20, and elsewhere herein. Briefly, the protein may possess the 
following activities: cytokine, cell proliferation/differentiation modulating activity or 
induction of other cytokines; immunostimulating/immunosuppressant activities (e.g. 
for treating human immunodeficiency virus infection, cancer, autoimmune diseases 
and allergy); regulation of hematopoiesis (e.g. for treating anemia or as adjunct to 
chemotherapy); stimulation or growth of bone, cartilage, tendons, ligaments and/or 
nerves (e.g. for treating wounds, stimulation of follicle stimulating hormone (for 
control of fertility); chemotactic and chemokinetic activities (e.g. for treating 
infections, tumors); hemostatic or thrombolytic activity (e.g. for treating hemophilia, 
cardiac infarction etc.); anti-inflammatory activity (e.g. for treating septic shock, 
Crohn's Disease); as antimicrobials; for treating psoriasis or other hyperproliferative 
diseases; for regulation of metabolism, and behavior. Also contemplated is the use of 
the corresponding nucleic acid in gene therapy procedures. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 70 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is .any integer between 1 to 1628 of SEQ ID NO: 70, b is an 
integer of 15 to 1642, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:70, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 61 

The translation product of this gene shares sequence homology with 
polypeptide in the cystatin family. Cystatin polypeptides are cysteine protease 
inhibitors. For an analysis of the composition of several members of the cystatin 
family see Gene (1987) 6 l(3):329-338, incorporated herein by reference. The cystatin 
activity of polypeptides encoded by this gene is measured by several assays known in 
the art including assays described in coowned, copending US Patent Application 
Serial No. 08/744, 138, incorporated herein by reference. Preferred polypeptides of the 
invention comprise the following amino acid sequence: LPATVEFA VHTFNQQS KD 
YYAYRLGHILNSWkEQVESKTWSMEIXLGRTRCGKFEDDroNCHf^ESTEL 
NNTFTCFFTISTRPWNfTQFSLLNKTC (SEQ ID NO: 324). Fragments of such' 
polypeptides having cystatin activity (cysteine protease inhibitory activity are 
particularly preferred). Polynucleotides encoding such polypeptides are also provided. 

In another embodiment polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: LLWARGLGRAKS AVPTVST MLGLPWKGGLS 
WALLLLLLGSQILLIYAWHFHEQRDCDEHN^ 

KDYYAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQE 
STELNNTFTCFFTISTRPWMTQFSLLNK TCLEGFH (SEQ ID NO: 325). 
Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in testes and epididiymus. For a review of a 
cystatin showing testes- specific expression see Mol. Endocrinol. (1992 Oct.) 
6(10): 1653- 1664, incorporated herein by reference. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, They should therefore serve a protective function to regulate the 
activities of such endogenous proteinases, which otherwise may cause uncontrolled 
proteolysis and tissue damage. Cysteine proteinase activity can normally not be 
measured in body fluids, but can been detected extracellulariy in conditions like 
endotoxin-induced sepsis, metastasizing cancer, and at local inflammatory processes 
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in rheumatoid arthritis , purulent bronchiectasis and periodontitis. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., reproductive, testicular, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, seminal fluid, synovial fluid and 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 184 as residues; Phe-31 to Asp-38. Asn-59 to Tyr-65, 
Ser-76 to Glu-82, Thr-96 to Cys-108, GIn-1 1 1 to Asn-118. Polynucleotides encoding 
said polypeptides are also provided. 

The tissue distribution in testes and epididiymus. combined with the 
homology to cystatins indicates that polynucleotides and polypeptides corresponding 
to this gene are useful for the treatment and diagnosis of conditions concerning proper 
testicular function (e.g. endocrine function, sperm maturation), as well as cancer. 
Therefore, this gene product is useful in the treatment of male infertility and/or 
impotence. This gene product is also useful in assays designed to identify binding 
agents, as such agents (antagonists) are useful as male contraceptive agents. Similarly, 
the protein is believed to be useful in the treatment and/or diagnosis of testicular 
cancer. The testes are also a site of active gene expression of transcripts that is 
expressed, particularly at low levels, in other tissues of the body. Therefore, this gene 
product is expressed in other specific tissues or organs where it may play related 
functional roles in other processes, such as hematopoiesis, inflammation, bone 
formation, and kidney function, to name a few possible target indications. 
Representative uses are described in the "Hyperproliferative Disorders" and 
"Regeneration" sections below and elsewhere herein. Briefly, developmental tissues 
rely on decisions involving cell differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
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of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention arc useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 

v 

degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used lo gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Cysteine proteinase inhibitors of the cystatin superfamily are ubiquitous in the 
body and are generally tight-binding inhibitors of papain-like cysteine proteinases, 
such as cathepsins B, H, L, S, and K. They should therefore serve a protective 
function to regulate the activities of such endogenous proteinases, which otherwise 
may cause uncontrolled proteolysis and tissue damage. Cysteine proteinase activity 
can normally not be measured in body fluids, but can been detected extraceliularly in 
conditions like endotoxin-induced sepsis, metastasizing cancer, and at local 
inflammatory processes in rheumatoid arthritis, purulent bronchiectasis and 
periodontitis, which indicates that a tight cystatin regulation is a necessity in the 
normal state. A deficiency state in which the levels of the intracellular cystatin, 
cystatin B, are lowered due to mutations has recently been shown to segregate with a 
form of progressive myoclonus epilepsy, which points to additional specialized 
functions of cystatins. Moreover, results showing that chicken cystatin inhibits polio 
virus replication, human cystatin C inhibits corona- and herpes simplex virus 
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replication, and human cy statin A inhibits rhabdovirus-induced apoptosis in cell 
cultures indicates that cystatins play additional roles in the human defense system. 
The cystatins constitute a superfamily of evolutionary related proteins, all composed 
of at least one 100-120 residue domain with conserved sequence motifs. 

The previously well characterized single-domain human members of this 
superfamily could be grouped in two protein families. The Family I members, 
cystatins (or stetins) A and B, contain approximately 100 amino acid residues, lack 
disulfide bridges, and are not synthesized as preproteins with signal peptides. The 
Family 2 cystatins (cystatins C, D, S, SN, and SA) are secreted proteins of approx. 
120 amino acid residues (Mr 13,000-14,000) and have two characteristic intrachain 
disulfide bonds. Recendy, we identified an additional human cystatin superfamily ' 
member hy EST1 sequencing in epithelial cell derived cDNA libraries which we 
named cystatin E. The same cystatin was independently discovered by differential 
display experiments as a mRNA species down-regulated in breast tumor tissue, but 
present in the surrounding epithelium and reported under the name cystatin M . 
Cystatin E/M is an atypical, secreted low-Mr cystatin in that it is a glycoprotein and 
just shows 30-35% sequence identity in alignments with the human Family 2 
cystatins, which shows that additional cystatin families are yet to be identified. The 
cystatin E/M gene has been localized to chromosome 2 , whereas all human Family 2 
cystatin genes are clustered on the short arm of chromosome 20, which further 
stresses that cystatin E/M is just distantly related to the other secreted human low-Mr 
cystatins. It is believed therefore, that polypeptides encoded by this gene are useful in 
diagnosing and treating disease consistent with the aforementioned conditions in 
which cystatins are implicated. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:7l and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 907 of SEQ ID NO:71, b is an 
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integer of 15 to 92 1 , where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 7 1, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 62 

The translation product of this gene shares sequence homology with 
Neutrophil Gelatinase-Associated Lipocaiin which is thought to be important in 
immune regulation (See Genbank andGeneseq Accession Nos. emblCAA58l27.1, 
and US5627034, respectively; all references and information available through these 
accessions are hereby incorporated herein by reference; for example, Biochem. 
Biophys. Res. Commun. 202 (3), 1468-1475 (1994), and FEBS Lett. 314 (3). 386-388 
(1992)). 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: LEQKLELHRGGGRSRTSGSPGLQEFGTREERGE 
GEQRTGREFSGNGGRAVEAARMRLLCGLWLWLSLLKVLQAQTPTPLPLPP 
PMQSFQGNQFQGEVVTVLGLAGNSFRPEHRALLNAFTATraLSDDGRFEVWN 
AMTRGQHCDTWSYVLIPAAQPGQFTVDHGVGRSWLLPPGT1XK3FICLGRAQ 
GLSDDN1VFPDVTGXALDL XSLPWVAAPA (SEQ ID NO: 326). Polynucleotides 
encoding these polypeptides are also provided. 

This gene is expressed primarily in epididiymus and osteoclastoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, reproductive and skeletal diseases and/or disorders, particularly cancers 
such as osteoclastoma testicular cancer. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels is routinely detected in certain tissues or cell 
types (e.g., reproductive, testicular, skeletal, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, seminal fluid, urine, synovial fluid and spinal fluid) 
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or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
lissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 185 as residues: Met-82 to Thr-90. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in epididiymus and homology to neutrophil gelatinase- 
associated lipocalin indicates that polynucleotides and polypeptides corresponding to 
this gene are useful for diagnosis and treatment of skin diseases and immune system 
disorders such as cancer\ particularly osteoclastoma. The secreted protein can also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, and as 
nutritional supplements. It may also have a very wide range of biological activities. 
Representative uses are described in the "Chemotaxis" and "Binding Activity" 
sections below, in Examples 11, 12. 13, 14, 15, 16, 18, 19. and 20, and elsewhere 
herein. Briefly, the protein may possess the following activities: cytokine, cell 
prolifcration/difTerentiation modulating activity or induction of other cytokines; 
irnmunostimulating/immunosuppressant activities (e.g. for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of hematopoiesis (e.g. for treating anemia or as adjunct to chemotherapy); 
stimulation or growth of bone, cartilage, tendons, ligaments and/or nerves (e.g. for 
treating wounds, stimulation of follicle stimulating hormone (for control of fertility); 
chemotactic and chemokinetic activities (e.g. for treating infections, tumors); 
hemostatic or thrombolytic activity (e.g. for treating hemophilia, cardiac infarction 
etc.); anti-inflairunatory activity (e.g. for treating septic shock, Crohn's Disease); as 
antimicrobials; for treating psoriasis or other hyperproliferative diseases; for 
regulation of metabolism, and behavior. Also contemplated is the use of the 
corresponding nucleic acid in gene therapy procedures. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO: 72 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 892 of SEQ ID NO: 72, b is an 
integer of 15 to 906, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:72, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 63 

The translation product of this gene was shown to have homology to colipase 
which plays an essentia! role in the intestinal fat digestion by anchoring lipase on 
lipid/water interfaces in the presence of bile salts (See Genbank Accession No. 
gb!AAA035 13.1; all references and information available through this accession are 
hereby incorporated by reference herein). 

This gene is expressed primarily in epididiymus. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, reproductive diseases and/or disorders, particularly epididiymus -related 
diseases. Similarly, polypeptides and autibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., reproductive, 
metabolic, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
seminal fluid, bile, chyme, urine, synovial fluid and spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from 
an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 1 86 as residues: Ile-40 to Cys-49, Arg-52 to Cys-57, 
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Ser-94 to Trp-99, Gly-105 to Gly-1 1 1. Polynucleotides encoding said polypeptides 
are also provided. 

The tissue distribution in epididiymus indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
immune system diseases and disorders of the epididiymas. Polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment and diagnosis of 
conditions concerning proper testicular function (e.g. endocrine function, sperm 
maturation), as well as cancer. Therefore, this gene product is useful in the treatment 
of male infertility and/or^impotence. This gene product is also useful in assays 
designed to identify binding agents, as such agents (antagonists) are useful as male 
contraceptive agents. Similarly, the protein is believed to be useful in the treatment 
and/or diagnosis of testicular cancer. The testes are also a site of active gene 
expression of transcripts that is expressed, particularly at low levels, in other tissues 
of the body. Therefore, this gene product is expressed in other specific tissues or 
organs where it may play related functional roles in other processes, such as 
hematopoiesis, inflammation, bone formation, and kidney function, to name a few 
possible target indications. Furthermore, the protein may also be used to determine 
biological activity, to raise antibodies, as tissue markers, to isolate cognate Ugands or 
receptors, to identify agents that modulate their interactions, in addition to its use as a 
nutritional supplement. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, arc publicly 
available and accessible through sequence databases. Some of these sequences arc 
related to SEQ ID NO: 73 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described hy the general 
formula of a-b, where a is any integer between 1 to 666 of SEQ ID NO:73, b is an 
integer of 15 to 680, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:73. and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 64 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: MCVCERKRGREKEGGVTPTMTSNFPFCTLILXjI 
AQAQACPGCPGDWPGLGSGVGEGLHHIRTCRTPIPCSPPAPAAACLGSGH 
ARLPCVUU-WPVPANI^SPFRLEALHCSFWSSPLLPAPIILAFFGFRDLLTDFL 
LAACLLTFQKTPLELI^MA WHLLV AIPCYQMLDNLPLPS AAAN WC (SEQ JD 
NO: 327). Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in melanocytes and placenta and to a lesser 
extent in bone marrow and many cells of the immune system, including B-cells, 
dendritic cells, and T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, skin cancer and disorders of the reproductive and immune systems. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
reproductive and immune systems, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues and cell types (e.g., reproductive 
tissue, hematopoietic tissue, melanocytes and cells and tissue of the immune system, 
and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, amniotic 
fluid, urine, synovial fluid and spinal iluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

The tissue distribution in melanocytes indicates that polynucleotides and 
polypeptides corresponding to this gene arc useful for the diagnosis and treatment of 
disorders affecting the skin, the reproductive system, and the immune system, 
particularly cancers. Representative uses are described in the "Biological Activity", 
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"Hyperproliferative Disorders", "infectious disease" , and "Regeneration" sections 
below, in Example 1 I, 19. and 20, and elsewhere herein. Briefly, the protein is useful 
in detecting, treating, and/or preventing congenital disorders (i.e. nevi, moles, 
freckles, Mongolian spots, hemangiomas, port-wine syndrome), integumentary 
tumors (i.e. keratoses, Bowen's Disease, basal cell carcinoma, squamous cell 
carcinoma, malignant melanoma, Paget's Disease, mycosis fungoides, and Kaposi's 
sarcoma), injuries and inflammation of the skin (i.e.wounds, rashes, prickly heat 
disorder, psoriasis, dermatitis), atherosclerosis, uticaria, eczema, photosensitivity, 
autoimmune disorders (Le. lupus erythematosus, vitiligo, dennatomyositis, morphea, 
scleroderma, pemphigoid, and pemphigus), keloids, striae, erythema, petechiae, 
purpura, and xanthelasma. In addition, such disorders may predispose increased 
susceptibility to viral and bacterial infections of the skin (i.e. cold sores, warts, 
chickenpox, molluscum contagiosum, herpes zoster, boils, cellulitis, erysipelas, 
impetigo, tinea, althletes toot, and ringworm). Moreover, the protein product of this 
gene may also be useful for the treatment or diagnosis of various connective tissue 
disorders (i.e., arthritis, trauma, tendonitis, chrondomalacia and inflammation, etc.), 
autoimmune disorders (i.e., rheumatoid arthritis, lupus, scleroderma, 
dennatomyositis, etc.), dwarfism, spinal deformation, joint abnormalities, amd 
chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, familial 
osteoarthritis, Atelosteogenesis type II, metaphyseal chondrodysplasia type Schmid). 
Furthermore, the protein may also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:74 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
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formula of a-b, where a is any integer between I to 1619 of SEQ ID NO:74, b is an 
integer of 15 to 1633, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:74, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 65 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: YLWGRPRLRMRAGTSPSAPWGEKREKLGHKLPVALQGYHPWIL 
LECTVFWARVVLACFSLYLIRGPNCINRQPEPTYQKACNLDCSSDFGQER 
AP AWELLGPES EQRLREYTAQGLQSLASS HRWRQFKTEGKMRGGAS PLPWLI 
CFW LCSYKGSDNSL^PVVPGPTLCPQSLVSPSVHPSTRSASLGRHRAEAA 
(SEQ ID NO: 328). Polynucleotides encoding these polypeptides are also provided. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides 0 f the invention comprise the 
following amino acid sequence: MPGILAG1PVKDLCLSLLQGFRLLLLCVCPGWL 
SGWMGGQKGSPRIVDIG (SEQ ID NO: 329). Polynucleotides encoding these 
polypeptides are also provided. This gene maps to chromosome 15, accordingly, 
polynucleotides of the invention is used in linkage analysis as a marker for 
chromosome 15. 

This gene is expressed primarily in brain and breast and to a lesser extent in 
the liver, pancreas, and T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the lissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders affecting the brain and CNS, the reproductive system, or the 
immune system. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
tissue(s) or cell typc(s). For a number of disorders of the above tissues or cells, 
particularly of the central nervous system, the reproductive system, and the immune 
system, expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues or cell types (e.g., brain and other tissue of the nervous 
system, mammary tissue, eendocrine tissue, hepatic tissue, reproductive tissue, cells 



WO 00/06698 



165 



PCT/US99/17I30 



and tissue of the immune system, cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 188 as residues: Met-37 to Ser-43. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in brain cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
disorders affecting the central nervous system, the reproductive system, and the • 
immune system, including cancers. Representative uses are described in the 
"Regeneration" and "Hyperproliferative Disorders" sections below, in Example 1 1, 
15, and 18, and elsewhere herein. Briefly, the uses include, but are not limited to the 
detection, treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, de myelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain indicates it plays a role in 
normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as. antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:75 and may have been publicly available prior ip conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1008 of SEQ ID NO:75, b is an 
integer of 15 to 1022, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:75, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 66 

In another embodiment, polypeptides comprising the amino acid sequence o'f 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically; polypeptides of the invention comprise the 
following amino acid sequence: A KGEERKEAFS LJCM VQLS SEPISFGLM YLYLGV 
FraLIYrcALSrTTLGKHSHPI^AEQNSTVWMEHTLFHQSPVASHLVCFQSF 
AFSE (SEQ ID NO: 330). Polynucleotides encoding these polypeptides are also 
provided. 

This gene is expressed primarily in the brain and the immune system, in 
particular T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or ceil type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders affecting the brain, such as Alzheimer's or disorders affecting 
the immune system, such as AIDS. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing, immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the brain and CNS and the immune systems, expression 
of this gene at significantly higher or lower levels is routinely detected in certain 
tissues and cell types (e.g., brain and other tissue of the nervous system, cells and 
tissue of the immune system, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
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expression level, i.e. : the expression level in healthy tissue or bodily fluid from an 

individual not having the disorder. 

The tissue distribution in brain cells and tissues indicates that polynucleotides 

and polypeptides corresponding to this gene are useful for the diagnosis and treatment 

of disorders affecting the brain and CNS or disorders affecting the immune system. 

Representative uses are described in the "Regeneration" and "Hyperproliferative 

Disorders" sections below, in Example 11. 15. and 18. and elsewhere herein. Briefly, 

the uses include, but are not limited to the detection, treatment, and/or prevention of 

Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 

. . . r 
meningitis, encephalitis, demyel mating diseases, peripheral neuropathies, neoplasia, 

trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated expression of this gene product 
in regions of the brain indicates it plays a role in normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers; to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues- 
Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 76 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention, to list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 170 of SEQ ID NO:76, b is an 
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integer of 15 to 1 184, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:76 ( and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 67 

The translation product of this gene shares sequence homology with 
pcnaetdin-2 which is thought to be a members of a new family of antimicrobial 
peptides from the nemo lymph of shrimps Penaeus vannamei. The molecules display 
antimicrobial activity against fungi and bacteria with a predominant activity against 
Gram-positive bacteria. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: GPAHPASPPLMTLSLQLAELVHFVCAFQSQWT 
GVYPMMPPLKPTEPLCFA CVPCRV (SEQ ID NO: 331). Polynucleotides 
encoding these polypeptides are also provided. 

This gene is expressed primarily in spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell typc(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune and hematopoietic diseases and/or disorders, particularly 
disorders affecting the spleen, including bacterial and fungal infections. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the hematopoeiic 
and immune systems, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues and cell types (e.g.. immune, hematopoietic, and 
cells and tissue of the immune system, cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from 
an individual not having the disorder. 
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The tissue distribution in spleen and homology to the penaeidin family of 
antibiotics indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for the diagnosis and treatment of disorders affecting the spleen, especially 
fungal and bacterial infections. Representative uses are described in the "Immune 
Activity ' and "infectious disease" sections below, in Example 11, 13. 14, 16, 18, 19, 
20, and 27, and elsewhere herein. Briefly, the expression of ihis gene product 
indicates a role in regulating the proliferation; survival; differentiation; and/or 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
other processes suggesting a usefulness in the treatment of cancer (e.g. by boosting 
immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host- versus-graft and graft- versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, arc publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:77 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 298 of SEQ ID NO:77, b is an 
integer of 15 to 3 12, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ CD NO:77, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 68 

Contact of cells with supernatant expressing the product of this gene has been 
shown to increase the permeability of the plasma membrane of THP-1 cells to 
calcium. Thus it is likely that the product of this gene is involved in a signal 
transduction pathway that is initiated when the product binds a receptor on the surface 
of the plasma membrane of both monocytes, in addition to other cell-lines or tissue 
cell types. Thus, polynucleotides and polypeptides have uses which include, but arc 
not limited to, activating immune and hematopoietic cells and tissue cell types. 
Binding of a ligand to a receptor is known to alter intracellular levels of small 
molecules, such as calcium, potassium and sodium, as well as alter pH and membrane 
potential. Alterations in small molecule concentration can be measured to identify 
supernatants which bind to receptors of a particular cell- 
Moreover, when tested in TF-I cell lines, the protein product of this gene has 
been shown to alter the steady-state messenger RNA levels of the following genes; c- 
fos, c-jun, egr- 1, b56l , bcl-2, CD40, cyclin D2, GADPH, ICER, MAD3, p21, STAT3, 
ID3, and STAT- 1 . When tested in U937 cell lines, the protein product of this gene has 
been shown to alter the steady-state messenger RNA levels of the following genes: 
egr2, MKPU ATF3, B562, cyclin D, cyclin D2, GAT A3, MAD3, p2I, TGF, DHFR, 
and JAK3. Based upon these results, it is anticipated that polynucleotides and 
polypeptides corresponding to this gene are useful as agonists or antagonists of the 
above referenced genes. Such activity is useful in therapeutic and/or diagnostic 
applicators as referenced and more specifically discussed elsewere herein. 
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Tn specific embodiments, pol peptides of the invention comprise the 
sequence:MLLEVYGDSISVTVAIPL (SEQ ID NO:332), 

MHSPCQSKAADGLGKSETE (SEQ ID NO: 333), and/or MLKSLGLSTN (SEQ ID 
NO; 334). Polynucleotides encoding these polypeptides are also provided. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: AQRLAEECFYMLLEVYGDSISVTVAIPLMHSP 
CQSKAADGLGKSETpMLKSLGLSTNMSPFHLLGL 

FFEVQPLGLLALNr^CTATAGLK£LCMHPPSLAFTPEFHTSl^PLAIPSFCOTS 
VSLSNSHTIPLSLYLPFPSKS^IPDTLHLLVHSLPLVHSQVLPVKDVTIBWPLC 
QRCLGSTCH Q (SEQ ID NO: 335). Polynucleotides encoding these polypeptides 
are also provided. 

The polypeptide of this gene has been determined to have a transmembrane 
domain at about amino acid position U - 27 of the amino acid sequence referenced in 
Table 1 for this gene. Moreover, a cytoplasmic tail encompassing amino acids 28 to 
143 of this protein has also been determined. Based upon these characteristics, it is 
believed that the protein product of this gene shares structural features to type la 
membrane proteins. 

This gene is expressed primarily in neutrophils and T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders and/or diseases affecting the immune system. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the the immune 
system, expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues and cell types (e.g., immune, hematopoietic, cells and tissue 
of the immune system, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
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expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQID NO: 191 as residues: Pro-97 to Asp-104. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in neutrophils and T-cells. combined with the detected 
calcium flux biological activity indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of disorders 
affecting the immune system. Representative uses are described in the "Immune 
Activity" and "infectious cUsease'' sections below, in Example 11, 13, 14, 16, 18, 19, 
20, and 27, and elsewhere herein. Briefly, the expression of this gene product 
indicates a role in regulating the proliferation; survival; differentiation; and/or 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
other processes suggesting a usefulness in the treatment of cancer (e.g. by boosting 
immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft -versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus crythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
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interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO; 78 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1356 of SEQ ID NO:78, b is an 
integer of 15 to 1370, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:78, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 69 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: WIPRA AGIRHEVQVSLFQM FCFSSIFCSH 
EOTrttJM3TFWLFLr^FLILPPSCPCr^ 

PGTSIYYASRGGPXPNSEX (SEQ ID NO: 336). Polynucleotides encoding these 
polypeptides are also provided. 

This gene is expressed primarily in neutrophils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, diseases and/or disorders affecting the immune system, and neutrophils 
in particular. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels is routinely delected in certain tissues and cell types (e.g., blood cells, 
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and cells and tissue of the .immune system, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution in neutrophils indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
disorders affecting the immune system and neutrophils in particular. Representative 
uses arc described in the "Immune Activity" and "infectious disease" sections below 
in Example 1 1, 13, 14, .16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the 
expression of this gene product indicates a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including 
blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
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antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:79 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 354 of SEQ ID NO: 79, b is an 
integer of 1 5 to 368, where both a and b correspond to the positions of nucleotide* 
residues shown in SEQ ID NO:79, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 70 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: XNXKSPLTIGNKSWSSTAVAAALELVDPPGCR 
NSARDSPELVHLGKGRPRKLMTYLFCSSISLLLLKVHSSGHQDIRKAKSKVP 
RLLIIQCPQQRE (SEQ ID NO: 337). Polynucleotides encoding these polypeptides 
are also provided. 

This gene is expressed primarily in smooth muscle. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders affecting smooth muscle tissue, particularly vascular 
conditions. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of smooth muscle tissue expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., muscle, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
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fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 193 as residues: Ser-I8 to Va!-31. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution primarily in smooth muscle indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and treatment of disorders affecting smooth muscle tissue. Moreover, the 
protein is useful in the detection, treatment, and/or prevention of a variety of vascular 
disorders and conditions, which include, but are not limited to miscrovascuiar disease, 
vascular leak syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery 
disease, arteriosclerosis, and/or atherosclerosis. Furthermore, the protein may also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 80 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1074 of SEQ ID NO:80, b is an 
integer of 15 to 1088, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO;80, and where h is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 71 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
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the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: GPEENLSPSTPSQMPTI^'VKIjCLLQVCHGLFP 
LLKHWSQPMPLCVTLAPVS YWL (SEQ ID NO: 338). Polynucleotides encoding 
these polypeptides are also provided. 

This gene is expressed primarily in fetal heart, smooth muscle, and frontal 

cortex. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, muscular, vascular, or neural diseases and/or disorders, particularly 
defects or injury to cardiac muscle. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the cardiovascular system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or ceil types 
(e.g., muscular, vascular, neural, developmental, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, amniotic fluid, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in fetal heart indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosing and treating defects 
to the heart either due to injury or congenital defects. Moreover, the protein is useful 
in the detection, treatment, and/or prevention of a variety of vascular disorders and 
conditions, which include, but are not limited to miscrovascular disease, vascular leak 
syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery disease, 
arteriosclerosis, and/or atherosclerosis. Alternatively, polynucleotides and 
polypeptides corresponding to this gene are useful for the detection, treatment, and/or 
prevention of neurodegenerative disease states, behavioral disorders, or inflammatory 
conditions. Representative uses are described in the "Regeneration" and 
"Hyperproliferative Disorders" sections below, in Example 11, 15, and 18, and 
elsewhere herein. Briefly, the uses include, but are not limited to the detection, 
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treatment, and/or prevention of Alzheimer's Disease, Parkinson s Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain indicates it plays a role in 
normal neural function^Furthermore, the protein may also be used to determine 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate their interactions, in addition to its use as a 
nutritional supplement. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:81 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1848 of SEQ ID NO:8 1. b is an 
integer of 15 to 1862, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:81, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 72 

The translation product of this gene shares sequence homology with adipose 
complement related protein which is thought to be important in regulating energy 
metabolism, insulin levels and fat stores. Moreover, the protein product of this gene 
has also been shown to have homology to the complement subcomponent C1Q A- 
chain precursor and HP-25 protein (See Genbank and Geneseq Accession Nos. 
emblCAA41664. 1, dbjlBAA02352.i, and W98013; ail references and information 
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available through this accession are hereby incorporated by reference herein). Based 
on the sequence similarity, the translation product of this gene is expected to share at 
least some biological activities with complement proteins. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: PRVRKEPEAMQWLRVRESPGEATGHRVTMG 
TAALGPVWAALLLFLLMCEIPMVELTFDRAVASDCQRCCDSEDPLDPAHVSS 
ASSSGRPHALPEIRPYINITILKGDKGDPGPMGLPGYMGREGPQGEPGPQGSK 
GDKGEMGSPGAPCQKRFFAFSVGRKTALHSGEDFQTLLFERVFVNLDGC 
FDMATGQFAAPLRGTYFFSLNVHSWNYKETYVHIMHNQKEAVILYAQPS ' 
ERSIMQSQSVMLDLAYGDRVWVRLFKRQRENAIYSNDFDTYITFSGHLIKA 
EDD (SEQ ID NO: 339). Polynucleotides encoding these polypeptides are also 
provided. 

This gene is expressed primarily in placenta and, fetal kidney, and umbilical 
vein and to a lesser extent in fetal heart, fetal liver/spleen, microvascular endothelial 
cells and cancers of the lung and pharynx. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, vascular, renal, and reproductive diseases and/or disorders, particularly 
cancers of the lung and pharynx. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the pulmonary and immune systems, expression of this 
gene at significantly higher or lower levels is routinely detected in certain tissues or 
cell types (e.g., vascular, renal, reproductive, immune, hematpoietic, pulmonary, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 195 as residues: Asp-36 to Asp-48, Ser-57 to His-62 t 
Lys-77 to Gly-84, Met-92 to Gly-1 14, GIn-203 to IIe-209, Lys-23 1 to Tyr-239. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in pharynx or lung, combined with the homology to 
adipose complement related proteins indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosing and treating cancers of the 
pharynx or lung by modifying the metabolic balance in such tissues. Moreover, the 
protein is useful in the detection, treatment, and/or prevention of a variety of vascular 
disorders and conditions, which include, but are not limited to miscrovascular disease, 
vascular leak syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery 
disease, arteriosclerosis, and/or atherosclerosis. The gene product may also be 
involved in lymphopoiesis, therefore, it can be used in immune disorders such as 
infection, inflammation, allergy, immunodeficiency etc. In addition, this gene product 
may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Furthermore, the protein may also be used to determine biological 
activity, to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, 
to identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 8 2 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1604 of SEQ lDNO:82. b is an 
integer of 15 to 1618, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:82, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 73 

The translation product of this gene shares sequence homology with a 
hypothetical 54.7 kD protein (F37A4.1) from Caenorhabditis eiegans (SwissProt 
locus YPT1_CAEEL, accession P41879). The protein product of ihis gene also has 
homology to the human NG26 which is thought to contain a human major 
histocompatibility complex class in and is involved in T-cell maturation (See 
Genbank Accession No. gblAAD18079.1l(AFl29756): all references and information 
available through this accession are hereby incorporated by reference herein; for 
example, J. Neurochcnx 69 (6), 2516-2528 (1997)). Based on the sequence similarity, 
the translation product of this gene is expected to share at least some biological 
activities with nitric oxide synthase proteins. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: MLYPGSVYLLQKALMPVLLQGQARLVEECNGRRAKLLACDGNE 
IDTMFVDRRGTAEPQGQKLV1CCEGNAGFYEVGCVSTPLEAGYSVLGWNHP 
GFAGSTGVPFPQNEANAMDVVVQFAIHRLGFQPQDIIIYAWSIGGFTATWAA 
MSYPDVSAMILDASFDDLVPLALKVMPDSWRGLVTRTVRQHLNLNNAEQLC 
RYQGPVlXIRRTKDEIITTTVPEDIMSNRGNDLLLBaLQHRYPRVMAEEGLRV 
VRQWLEASSQLEEASIYSRWEVEEDWCLSVLRSYQAEHGPDFPWSVGEDMS 
AIKjRRQLALFLARKHLHNFEATHCTPLPAQNFQMPWHL (SEQ ID NO: 340). 
Polynucleotides encoding such polypeptides are also provided. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: VCPKWCRFLTMLGHCCYFWQVWPASEALAA 
GPTPSTGSSSPSWKQHIGTSLQKTRGSLPTTTLTSGAGQSTSTGKNPAAGR 
SLEGALPAGVWPCFAQSPCTGGQQTP SSTGLRSCLVRSPATWWRTP (SEQ ID 
NO: 341). Polynucleotides encoding these polypeptides are also provided. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: WPRAAGIRJiEmiEXDSERAPASVPETPTAVTAPHSSSWDTYYQ 
PRALEKHADSILALASVFWSISYYSSPFAFFYLYRKGYLSLSKVVPFSHYAG 
TLIXLLAGVACXRGIGRWTNPQYRQFITILEATHRNQSSENKRQLANYNFD 
FRS WPVDFH WEEPS SRKESRGG PS RRGVALLRPEPLHRGTADTLLNRVKKL 
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PCQrrSYLVAHTLGRR^YroSVVLLQKALMPVLLQGQARLVEECNGRRAK 
LLACDGNEIDTMFVDRRGTAEPQGQKLVICCEGNAGFYEVGCVSTPLEAGYS 
VLGWNHPGFAGSTGVPFPQNEANAMDVVVQFAIHRLGFQPQDnrYAWSI 
GGFTATWAAMSYPDVSAMILDASFDDLVPLALKVMPDSWRGLVTRTVRQ 
HLNLNNAEQLCRYQGPVLLIRRTKI>EIITTTVPEDIMSNRGNDLLLKLLQHRY 
PRVMAEEGLR VVRQWLE ASSQLEE ASI YS RWE VEEDWCLS VLRS YQ AEHGP 
DFPWSVGEDMSADGRRQLALFLARKHLHNFEATHCT PLPAQNFQMPWHL 
(SEQ ID NO: 342). Polynucleotides encoding these polypeptides arc also provided. A 
preferred polypeptide vaj iant of the invention comprises the following amino acid 
sequence: HERAXGPSRGHGELLSCVLGPRLYKIYRERDSERAPASVPETPTA 
VTAPHSSSWDTYYQP RALEKHADSILALASVFWSISYYSSPFAFFYLYRKGY 
LSLSKVVPFSHYAGTLLLLLAGV ACSEALAAGPTPSTGSSSPSWKQHIGTSLQ 
JCTRGSLPTTTLTSGAGQSTSTGKNPAAGRSLEGALPAGVWPCFAQSPCTGG 
QQTPSSTGL RSCLVRSPATWWRTP (SEQ ID NO: 343). Polynucleotides 
encoding these polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
6. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 6. 

This gene is expressed primarily in cerebellum, pituitary, fetal liver, and 
primary dendritic cells and to a lesser extent in in a wide range of tissues and 
developmental stages (i.e. fetal and adult tissue, etc.). 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neural, developmental, and immune diseases and/or disorders, 
particularly those involving self recognition and T- anc B-cell maturation, and cancer. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
neural or hormonal system, expression of this gene at significantly higher or lower 
levels is routinely detected in certain tissues or cell types (e.g.. neural, developmental, 
immune, hepatic, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
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plasma, amniotic fluid, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ED NO: 196 as residues: Thr-23 to Lys-34, Leu-41 to Ser-47. 
Ala-57 to Ala-68, Pro-89 to Gly- 1 0 1 , Pro- 1 1 0 to Pro- 1 ! 7. Polynucleotides encoding 
said polypeptides are also provided. 

The tissue distribution in developmental and immune cells, combined with the 
homology to the human major histocompatibility complex class III region, indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 
treatment and diagnosis of cancer and other proliferative disorders. Representative 
uses are described in the "Immune Activity" and "infectious disease" sections below, 
in Example 1 1, 13, 14, 16, 18, 19, 20. and 27, and elsewhere herein. Briefly, the 
expression of this gene product indicates a role in regulating the proliferation; 
survival; differentiation; anaVor activation of hematopoietic cell lineages, including 
blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-ceJi mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematous, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
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in the differentiation ami/or proliferation of various cell types. Potentially, this gene 
product is involved in synapse formation, neurotransmission, learning, cognition, 
homeostasis, or neuronal differentiation or survival. Furthermore, the protein may 
also be used to determine biological activity, raise antibodies, as tissue markers, to 
isolate cognate ligands or receptors, to identify agents that modulate their interactions, 
in addition to its use as a nutritional supplement. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 83 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2020 of SEQ ID NO:83, b is an 
integer of 15 to 2034, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:83, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 74 

The translation product of this gene shares sequence homology with the br- 1 
protein from the snail nervous system (EMBL HPBRIGENE) which codes for nitric 
oxide synthetase and which is thought to be important in mediating a variety of 
cellular responses, including vasodilation. Preferred polypeptides of the invention 
comprise the following amino acid sequence: MFKRHQRLKKDSTQAEEDLSEQ 
EQNQLNVLKKHGYVVGRVGRTFLYSEEQKDNTPFEFDADSLAFDMENDPVM 
GTHKSTKQVELTAQDVKDAHWFYDTPGITKENC1LNLLTEKEVNIVLPTQSIV 
PRTFVLKPGM VIJTLG AIGRIDFLQGNQS A WFT V V AS N I LP V H1TSLDRADAL Y 
QKHAGHTLLQIPMGGKERMAGFPPLVAEDIMLKEGLGASEAVADIfCFSSAG 
WVSVTPNFKDRLHLRGYTPEGTVLTVRPPLLPYIVNIKGQR1KKSVAYKTKKP 
PSLMYNVRKKKGKINV (SEQ ID NO: 344). Polynucleotides encoding such 
polypeptides are also provided. 
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A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MLP A R LPFRLLS LFLRGS APTAARHG LREPLLERRCA A 
ASSFQHSSSLGRELPYDPVDTEGFGEGGDMQERFLFPEY1LDPEPQFTREKQL 
QELQQQQEEEERQRQQRREERRQQNLRARSREHPWGHPDPALPPSGVNCS 
GCG AXLHCQD AG VPG YLPRBKJT^RTAEACKjGL ARTVCQRCWLLS HHRRALR 
LQVSREQYLELVSAALRXPGPSLVLYMVDLLDLPDALLPDLPALVGPKQUV 
UJNKVDLLPQDAPGYRQRLRERLWEDCARAGLLLAPGTKGHSAPSRTSHR 
TGRIRIRRTGPAQWSGTCG (SEQ ID NO: 345). Polynucleotides encoding these 
polypeptides are also provided. 

When tested against U937 cell lines, supernatams removed from cells 
containing this gene activated the GAS (gamma activating sequence) promoter < 
element. Thus, it is likely that this gene activates myeloid cells through the JAK- 
STAT signal transduction pathway. GAS is a promoter element found upstream of 
many genes which are involved in the Jak-STAT pathway. The Jak-STAT pathway is 
a large, signal transduction pathway involved in the differentiation and proliferation 
of cells. Therefore, activation of the Jak-STAT pathway, reflected by the binding of 
the GAS element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

This gene is expressed primarily in early stage human brain, smooth muscle, 
and endometrial tumor and to a lesser extent in a variety of tissues representing many 
organs and developmental states. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cardiovascular, vascular, and neural diseases and/or disorders, 
particularly congestive heart disease and neurological disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or ceils, particularly of the circulatory and 
neural systems, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., cardiovascular, vascular, 
neural, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
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urine, amniotic fluid, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 197 as residues: Phe-42 to Leu-48, Pro-53 to Asp-58, 
Pro-8! to GJu-123, Asp-256 toTrp-269, GIy-282 to Ser-306, Arg-333 to Gly-339, 
Arg-403 to Gln-425, Ser-446 to Asn-<152, His-475 to Gln-480 f Gly-592 to Met-597, 
Pro-635 to His-642, Lys-667 to Lys-672, Lys-678 to Ser-684. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in smooth muscle and vascular tissues, combined with 
the homology to nitric oxide synthetase indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
congestive heart failure and neurological degenerative disorders, polynucleotides and 
polypeptides corresponding to this gene are useful for the detection, treatment, and/or 
prevention of neurodegenerative disease states, behavioral disorders, or inflammatory 
conditions. Representative uses are described in the "Regeneration" and 
"Hyperproliferative Disorders" sections below, in Example 1 1, 15, and 18, and 
elsewhere herein. Briefly, the uses include, but are not limited to the detection, 
treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease. 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyeltnating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain indicates it plays a role in 
normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Moreover, the protein is useful in the detection, treatment, and/or prevention 
of a variety of vascular disorders and conditions, which include, but are not limited to 
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microvascular disease, vascular leak syndrome, aneurysm, stroke, embolism, 
thrombosis, coronary artery disease, arteriosclerosis, and/or atherosclerosis. 
Furthermore, the protein may also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:84 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b. where a is any integer between I to 2226 of SEQ ID NO: 84, b is an 
integer of 15 to 2240, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:84, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 75 

The translation product of this gene shares sequence homology with the 
human KE04p, in addition to an unidentifed C.elegans gene. 

The polypeptide of this gene has been determined to have a transmembrane 
domain at about amino acid position 9 - 25 of the amino acid sequence referenced in 
Table 1 for this gene. Moreover, a cytoplasmic tail encompassing amino acids 1 to 8 
of this protein has also been determined. Based upon these characteristics, it is 
believed that the protein product of this gene shares structural features to type II 
membrane proteins. 

In another embodiment, polypeptides comprising die amino acid sequence of 
the open reading frame upstream of the predicted signal peptide arc contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: PSFRRERVETGGGGPVTIIGTEGPFLPLPGGTRM 
NMTQARVLVAAVVGLVAVLLYASIHKJEEGHLAVYYRGGALLTSPSGPGYH 
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IMLPFTITFRSVQTTLQTDEVKNVPCGTSGGVMIYIDRIEVVNMLAPYAVFDIV 
RNYTADYDCTLIFNKLIHHELNQrc 

NLMAPGLTIQAVRVTKPKIPEAIRRNFELMEAEKTKLLIAAQKQKVVEKEA 
ETERKKAVIEAEKIAQVAKJRFQQKVMEKETEKRISEIEDAAFLAREKAK^ 
DAEYYAAHK^ATSNKHKLTPEYLELKKYQAIASNSKIYFGSNIPNMFVDSSC 
ALKYSD IRTGRESS LPS KE ALEPSGEN VIQNKESTG (SEQ ID NO: 346). 
Polynucleotides encoding these polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
10. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 10. 

This gene is expressed primarily in fetal tissue, including 8 week whole 
embryo, fetal liver spleen, nine week old early stage human, fetal heartfetal liver, 
fetal lung, and placenta and to a lesser extent in a variety of cancers, and other normal 
tissues. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissuc(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer and diseases of fetal development. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the fetal tissues, especially the 
liver, expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues or cell types (e.g., developmental, hepatic, immune, 
hematopoietic, pulmonary, cardiovascular, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 198 as residues: Leu-68 to Lys-74, Tyr-109 to Lys- 
115, Gln-200 to Val-205, Lys-207 to Lys-214, Glu-237 to Ile-244, Ala-271 to Thr- 
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279, Ser-317 to Ser-329, pin- 342 to Gly-348. Polynucleotides encoding said 
polypeptides are also provided. 

The tissue distribution of this gene (primarily fetal tissue and cancerous tissue, 
both of which are undergoing rapid growth) indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for treatment and diagnosis of 
cancer and disorders of fetal development Moreover, the expression within fetal 
tissue and other cellular sources marked by proliferating cells indicates this protein 
may play a role in the regulation of cellular division, and may show utility in the 
diagnosis, treatment, and/or prevention of developmental diseases and disorders 
including cancer, and other proliferative conditions. Representative uses are described 
in the "Hyperproiiferative Disorders " and "Regeneration" .sections below and 
elsewhere herein. Briefly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SM A). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or pro life rati ve conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
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antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:85 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention arc one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1474 of SEQ ID NO:85, b is an 
integer of 1 5 to 1488, where both a and b correspond to the positions of nucleotide ' 
residues shown in SEQ ID NO:85, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 76 

When tested against U937 and Jurkat cell lines, supematants removed from 
cells containing this gene activated the GAS (gamma activating sequence) promoter 
element. Thus, it is likely that this gene activates myeloid and T-cetls, and to a lesser 
extent in other immune cells and tissue cell types, through the JAK-STAT signal 
transduction pathway. GAS is a promoter element found upstream of many genes 
which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, 
signal transduction pathway involved in the differentiation and proliferation of cells. 
Therefore, activation of the Jak-STAT pathway, reflected by the binding of the GAS 
element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: WSTGNASWEKKDNFILSADFEMMGLGNGRR 
SMKSPPLVLAALVACQVLGFNYWIASSRSVDLQTRIMELEGRVRRRAAERG 
AVELKKiNEFQGELEKQREQLDKIQSSPINFQLESVNKLYQDEKAVLVNNITTG 
ERLIR VLQ DQLKTLQRN YG RLQQ D VLQFQKNQTNLER KFS YD LSQCLNQM KE 
VKEQCEERICEVTKKGNEAVASRDLSENNDQRQQLQALSEPQPRLQAAGL 
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PHTEVPQGKGN VLGNS KSQTP APS S E V VLDS KRQ V EKEETNEIQ V VNEE 
PQRDRLPQEPGREQVVEDRPVGGRGFGGAGELGQTPQVQAALXVSQENPE 
MEGPERDQLVIPDGQEEEQEAAGEGRNQQKLRGEDDYNMDENEAESETDKQ 
AALAGNDRNIDVFNVE DQKRDTINLLDQREKRNHTL (SEQ ID NO: 347). 
Polynucleotides encoding these polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
9. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 9. 

This gene is expressed primarily in human endometrial tumor and other 
tumors and to a lesser extent in a variety of other healthy adult and fetal tissues 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental diseases and/or disorders, particularly cancer and other 
proliferative disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the endometrial tissue, cervix and uterus, expression of 
this gene at significantly higher or lower levels is routinely detected in certain tissues 
or cell types (e.g., developmental, reproductive, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 199 as residues: Asn-6 to Lys- 12, Leu-65 to Phe-70, 
Glu-73 to His-88, Gin- 123 to Gin- 135, Gin- 1 42 to Leu- 156, Arg-173 to GIy-181, 
Asp-189 to Gln-199, Scr-204 to Arg-209, Glu-219 to Gly-225, Gly-229 to Pro-238, 
Ser-246 to Asn-256, Glu-263 to Arg-276. Polynucleotides encoding said polypeptides 
are also provided. 

The tissue distribution in endometrial tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
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endometrial, cervical and uterine cancer Because of potential roles in proliferation 
and differentiation, this gene product may have applications in the adult for tissue 
regeneration and the treatment of cancers. It may also act as a morphogen to control 
cell and tissue type specification. Therefore, the polynucleotides and polypeptides of 
the present invention are useful in treating, delecting, and/or preventing said disorders 
and conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and is useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
The protein is useful in modulating the immune response to aberrant polypeptides, as 
may exist in proliferating and cancerous cells and tissues. The protein can also be 
used to gain new insight into the regulation of cellular growth and proliferation. ' 
Furthermore, the protein may also.be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate Iigands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. 
Protein, as well as, antibodies directed against the protein may show 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 86 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 3 160 of SEQ ID NO:86, b is an 
integer of 15 to 3 174, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:86, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 77 

The translation product of this gene shares sequence homology with protein 
disulfide isomerase from Acanthamoeba castellan ii (See Genbank Locus 
ACAD1SPROA accession L28 174, genpep locus 4560! 3) which is thought to be 
important in converting proteins into their native conformations. The protein product 
of this gene was also shown to have homology to a phospholipase C homoiogue 
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derived from a mast cell cDNA library (See Geneseq Accession No. R9941 1). All 
references and information available through these accessions are hereby incorporated 
by reference herein - for example, Gene 150(1), 175-179(1994). 

Included in this invention as preferred domains are endoplasmic reticulum 
targeting sequence domain and the thioredoxin family active site domain, which were 
identified using the ProSite analysts tool (Swiss Institute of Bioinformatics). Proteins 
that permanently reside in the lumen of the endoplasmic reticulum (ER) seem to be 
distinguished from newly synthesized secretory proteins by the presence of the C- 
terminal sequence Lys-^sp-Glu-Leu (FCDEL) [ 1 ,2Y While KDEL is the preferred 
signal in many species, variants of that signal are used by different species. This • 
situation is described in the following table. 

Signal Species 

Vertebrates, Drosophila, Caenorhabditis e ley ana, plants 
Saccharomyces cerevisiae, Kluyveromyces lactis, plants 
Kluyveromyces lactis 

Schizosaccharomyces porabe (fission yeast) 
Plasmodium falciparum 

The signal is usually very strictly conserved in major ER proteins but some 
minor ER proteins have divergent sequences (probably because efficient retention of 
these proteins is not crucial to the cell). Proteins bearing the KDEL-type signal are not 
simply held in the ER, but are selectively retrieved from a post-ER compartment by a 
receptor and returned to dieir normal location. The concensus pattern is as follows: 
[KRHQSA|-|DENQ]-E-L>. Thioredoxins are small proteins of approximately one 
hundred amino- acid residues which participate in various redox reactions via the 
reversible oxidation of an active center disulfide bond. They exist in either a reduced 
form or an oxidized form where the two cysteine residues are linked in an 
intramolecular disulfide bond. Thioredoxin is present in prokaryotes and cukaryotes 
and the sequence around die rcdox-active disulfide bond is well conserved. 
Bacteriophage T4 also encodes for a thioredoxin but its primary structure is not 
homologous to bacterial, plant and vertebrate thioredoxins. A number of eukaryoric 



KDEL 
HDEL 
DDEL 
ADEL 
SDEL 
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proteins contain domains evolutionary related to thioredoxin, all of them seem to be 
protein disulphide isomerases (PDI). PDI (EC 5.3.4.1) is an endoplasmic reticulum 
enzyme that catalyzes the rearrangement of disulfide bonds in various proteins. The 
various forms of PDI which are currently known are: - PDI major isozyme; a 
multifunctional protein that also function as the beta subunit of prolyl 4-hydroxylase 
(EC 1 . 14. 1 1 .2), as a component of oligosaccharyl transferase (EC 2.4. 1 . 1 19), as 
thyroxine deiodinase (EC 3.8. 1.4), as glutathione-insulin transhydrogenase (EC 
1.8.4.2) and as a thyroid hormone-binding protein - ERp60 (ER-60; 58 Kd 
microsomal protein). ERp60 was originally thought to be a phosphoinositide-specific 
phospholipase C isozyme and later to be a protease. - ERp72. - P5. All PDI contains 
two or three (ERp72) copies of the thioredoxin domain. The concensus pattern is as ' 
follows: [LIVMFJ-[LIVMSTAJ-x-[LIVMFYCJ-[FYWSTHE]-x(2)-[FYWGTN]-C- 
[GATPLVEj-[PH YWSTAl-C-x(6)-[LIVMFYWT]. The two Cs form the redox- 
active bond. 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: SLHRFVLSQAKDEL (SEQ ID NO: 348), FIKFFAPWCGHCKALAPTW 
(SEQ ID NO: 349), and/or FIKFYAPWCGHCKTLAPTW (SEQ ID NO: 350). 
Polynucleotides encoding these polypeptides are also provided. 

Further preferred are polypeptides comprising the endoplasmic reticulum 
targeting sequence domain and thioredoxin family active site domain of the sequence 
referenced in Table for this gene, and at least 5, 10, 15, 20, 25, 30, 50, or 75 
additional contiguous amino acid residues of this referenced sequence. The additional 
contiguous amino acid residues is N- terminal or C- terminal to the endoplasmic 
reticulum targeting sequence domain and thioredoxin family active site domain. 
Alternatively, the additional contiguous amino acid residues is both N-terminal and 
C-terminal to the endoplasmic reticulum targeting sequence domain and thioredoxin 
family active site domain, wherein the total N- and C-terminal contiguous amino acid 
residues equal the specified number. Based on the sequence similarity, the translation 
product of this gene is expected to share at least some biological activities with 
thioredoxin proteins. Such activities are known in the art. some of which are 
described elsewhere herein. 



WO 00/06698 



195 



PCT/US99/17130 



In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: RRGRGVPGPRGRRRLWSAACGHCQRLQPTWN 
DLGDKYNSMEXAKVYVAKVDCTAHSDVCSAQGVRGYPTLKLFKPGQEAV 
KY<^PRDFQTLErWMLQTI J NEEPVTPEPEVEPPSAPELKQGLYEL^ASNFELH 
VAQGDHHKFFAPWCGHCKALAPTWEQLALGLEHSETVKIGKVDCTQHY 
ELCSGNQVRGYPTLLWFRDGKKVDQYKGKRDLESLREYVESQLQRTETGA 
TETVTPSEAPVLAAEPEADKGTVLALTENNFDDTIAEGnTIKFYAPWCGHC 
KTL APTWEELSKKEFPGL AG V KI AE VDCT AERNIGS K YS VRG YPTLLLFRGGK 
KVSEHSGGRDLDS LHRFVLSQAKDEL (SEQ ID NO: 351). Polynucleotides - 
encoding these polypeptides are also provided. 

This gene is expressed primarily in human chondrosarcoma and endothelial 
cells and to a lesser extent in a wide range of normal and diseased adult and fetal 
tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, chondrosarcoma and other cancers and proliferative disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., vascular, skeletal, 
developmental, and cancerous and wounded [issues) or bodily fluids (e.g., serum, 
plasma, urine, amniotic fluid, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in chondrosarcoma, combined with the homology to 
protein disulfide isomerase and phospholipase C indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
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chondrosarcoma and other cancers and proliferative disorders, and possibly as a 
reagent for in vitro production of proteins. Representative uses are described in the 
"Hyperproliferative Disorders" and "Regeneration" sections below and elsewhere 
herein. Briefly, developmental tissues rely on decisions involving cell differentiation 
and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also he used to gain new 
insight into the regulation of cellular growth and proliferation. Moreover, the 
expression in endothelial cells indicates the protein is useful in the detection, 
treatment, and/or prevention of a variety of vascular disorders and conditions, which 
include, hut are not limited to miscrovascular disease, vascular leak syndrome, 
aneurysm, stroke, embolism, thrombosis, coronary artery disease, arteriosclerosis, 
and/or atherosclerosis. Furthermore, the protein may also be used to determine 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate their interactions, in addition to its use as a 
nutritional supplement. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible tlirough sequence databases. Some of these sequences are 
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related to SEQ DD NO: 87 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2766 of SEQ ID NO: 87, b is an 
integer of 15 to 2780, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:87, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 78 
This gene is expressed primarily in thyroid and thymus 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, thyroid diseases including thyroid cancer and diseases of function 
including Grave's Disease, hyper- and hypo- thyroidism as well a's Diseases of the 
thymus. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the endocrine and immune systems, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., endocrine, immune, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in thyroid cells and tissues indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 
and treatment of diseases of the thyroid and thymus. Representative uses arc 
described in the "Biological Activity", "Hyperproliferative Disorders", and "Binding 
Activity" sections below, in Example II. 17, 18, 19, 20 and 27 T and elsewhere herein. 
Briefly, the protein can be used for the detection, treatment, and/or prevention of 
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Addison's Disease, Cushing's Syndrome, and disorders and/or cancers of the 
pancrease (e.g. diabetes mellkus), adrenal cortex, ovaries, pituitary (e.g., hyper-, 
hypopituitarism), thyroid (e.g. hyper-, hypothyroidism), parathyroid (e.g. hyper- 
,hypoparathyroidism) , hypothalamus, and testes. Furthermore, the protein may also 
be used to determine biological activity, to raise antibodies, as tissue markets, to 
isolate cognate ligands or receptors, to identify agents that modulate their interactions, 
in addition to its use as a nutritional supplement. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 88 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between i to 1047 of SEQ ID NO:88, b is an 
integer of 15 to 1061, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:88, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 79 

The translation product of this gene shares sequence homology with collagen 
which is thought to be important as a structural material in a variety of human tissues 
and products including hair, nails, muscle and bone. 

A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQK 
AHSGRGRWTChMECAYKGQQECLVETGALGPMAFRVHLGSQVGMDSKEK 
RGNV (SEQ ID NO: 352). Polynucleotides encoding these polypeptides are also 
provided. 

This gene is expressed primarily in smooth muscle and to a lesser extent in 12 
week old early stage human, epdidymus, healing groin wound 1 , synovial hypoxia, 
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stromal cells, ulcerative colitis, breast and 8 week old embryo, as well as a variety of 
other normal and diseased cell types from adult and fetal tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer and other proliferative disorders as well a's Diseases of smooth 
muscle. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the muscular system, expression of this gene at significantly higher or 
lower levels is routinely detected in certain tissues or cell types (e.g., vascular, 
developmental, reproductive, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid aad spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 202 as residues: Glu-32 to Glu-46, Pro-63 to AIa-71, 
Pro-81 to Lys-90, Ser-97 to Trp-1 II, Lys-130 to Ser-135, Leu-147 to Cys-154, Asp- 
179 to Asn-186, Ser-219 to Gly-229. Polynucleotides encoding said polypeptides are 
also provided. 

The tissue distribution in smooth muscle and homology to collagen indicates 
that polynucleotides and polypeptides corresponding to this gene are useful for 
trearment and diagnosis of diseases of vascular diseases and/or disorders. 
Representative uses are described in the "Biological Activity", "Hypcrproliferative 
Disorders", "infectious disease", and "Regeneration" sections below, in Example 11, 
19, and 20, and elsewhere herein. Briefly, the protein is useful in detecting, treating, 
and/or preventing congenital disorders (i.e. nevi, moles, freckles, Mongolian spots, 
hemangiomas, port-wine syndrome), integumentary tumors (i.e. keratoses, Bowen's 
Disease, basal cell carcinoma, squamous cell carcinoma, malignant melanoma, 
Paget's Disease, mycosis fungoides, and Kaposi's sarcoma), injuries and inflammation 
of the skin (i.e. wounds, rashes, prickly heat disorder, psoriasis, dermatitis), 
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atherosclerosis, uticaria, eczema, photosensitivity, autoimmune disorders (i.e. lupus 
erythematosus, vitiligo, dcrmatomyosttis. morphea, scleroderma, pemphigoid, and 
pemphigus), keloids, striae, erythema, petechiae, purpura, and xanthelasma. In 
addition, such disorders may predispose increased susceptibility to viral and bacterial 
infections of the skin (i.e. cold sores, warts, chickenpox, molluscum contagiosum, 
herpes zoster, boils, cellulitis, erysipelas, impetigo, tinea, akhletes foot, and 
ringworm). 

Moreover, the protein product of this gene may also be useful for the 
treatment or diagnosis of various connective tissue disorders" (i.e., arthritis, trauma, 
tendonitis, chrondomalacia and inflammation, etc.), autoimmune disorders (i.e., 
rheumatoid arthritis, lupus, scleroderma, dermatomyositis, etc.), dwarfism, spinal ' 
deformation, joint abnormalities, amd chondrodysplasias (i.e. spondyloepiphyseal 
dysplasia congenita, familial osteoarthritis, Atelosteogenesis type II, metaphyseal 
chondrodysplasia type Schmid). Moreover, the protein is useful in the detection, 
treatment, and/or prevention of a variety of vascular disorders and conditions, which 
include, but are not limited to microvascular disease, vascular leak syndrome, 
aneurysm, stroke, embolism, thrombosis, coronary artery disease, arteriosclerosis, 
and/or atherosclerosis. Furthermore, the protein may also be used to determine 
biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate their interactions, in addition to its use as a 
nutritional supplement. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:89 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between i to 1328 of SEQ ID NO:89, b is an 
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integer of 15 to 1342, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 89, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 80 

This gene is expressed primarily in immune cells and to a lesser extent in a 
wide variety of human tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, T cell or B cell leukemia and various immunodeficiencies. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing • 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 203 as residues: GIy-3 to Gln-9. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in immune cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
immune system diseases such as immunodeficiencies and T cell and/or B cell 
leukemia. Representative uses are described in the "Immune Activity" and "infectious 
disease" sections below, in Example 1 1, 13, 14, 16, 18, 19, 20, and 27, and elsewhere 
herein. Briefly, the expression of this gene product indicates a role in regulating the 
proliferation; survival; differentiation; and/or activation of hematopoietic cell 
lineages, including hlood stem cells. This gene product is involved in the regulation of 
cytokine production, antigen presentation, or other processes suggesting a usefulness 
in the treatment of cancer (e.g. by boosting immune responses). 
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Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as ADOS, leukemia, rheumatoid arthritis, granuJomatou's Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-ccll mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft- versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood Lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate Iigands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:90 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 756 of SEQ ID NO:90, b is an 
integer of 15 to 770, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:90, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 81 

The translation product of this gene shares sequence homology with IgE 
receptor. See for example, Isolation and Characterization of cDNAs coding for the 
Beta Subunit of the High-affinity Receptor for Immunoglobulin E, Proc. Natl. Acad. 
Sci. USA. (1988 Sep.) 85(17): 6483-6487. Based on the sequence similarity, the 
translation product of this gene is expected to share at least some biological activities 
with IgE receptor proteins. Such activities are known in the art, some of which are 
described elsewhere herein. IgE and its receptors are believed to have evolved as a 
mechanism to protect mammals against parasites. But other and intrinsically 
innocuous antigens can subvert this system to provoke an allergic response. For 
human populations in industrialized countries, allergy and asthma now represent a far 
greater threat than parasitic infection, and the main impetus for current studies of the 
IgE system is the hope of understanding and intervening in the aetiology of allergic 
diseases. The high-affinity receptor for immunoglobulin (Ig) E (Fc epsilon RI) on 
mast cells and basophils plays a key role in IgE- mediated allergies. Fc epsilon RI is 
composed of one alpha, one beta, and two gamma chains, which are all required for 
cell surface expression of Fc epsilon RI, but only the alpha chain is involved in the 
binding to IgE. Fc epsilon RI-IgE interaction is highly species specific, and rodent Fc 
epsilon RI does not bind human IgE. New homolog can be used to develop ant- 
allergic agents. FcR deliver signals when they arc aggregated at the cell surface. The 
aggregation of FcR having immunorecepior tyros ine-based activation motifs (ITAMs) 
activates sequentially sre family tyrosine kinases and syk family tyrosine kinases that 
connect transduced signals to common activation pathways shared with other 
receptors. FcR with ITAMs elicit cell activation, endocytosis, and phagocytosis. The 
nature of responses depends primarily on the cell type. The aggregation of FcR 
without IT AM does not trigger cell activation. Most of these FcR internalize their 
ligands, which can be endocy tosed, phagocytosed, or transcytosed. The fate of 
internalized rcccptor-ligand complexes depends on defined sequences in the 
intracytoplasmic domain of the receptors. The coaggregation of different FcR results 
in positive or negative cooperation. Some FcR without ITAM use FcR with ITAM as 
signal transduction subunits. The coaggregation of antigen receptors or of FcR having 
ITAMs with FcR having immunorecepior tyrosine-based inhibition motifs (ITIMs) 
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negatively regulates cell activation. FcR therefore appear as the subunits of 
multichain receptors whose constitution is not predetermined and which deliver 
adaptative messages as a function of the environment. 

The polypeptide of this gene has been determined to have four transmembrane 
domains at about amino acid position 51 - 67, 89 - 105, 1 19 - 135, and 190 - 206 of 
the amino acid sequence referenced in Table 1 for this gene. Based upon these 
characteristics, it is believed that the protein product of this gene shares structural 
features to type Ola membrane proteins. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the ' 
following amino acid sequence: ETRVKTSLELLRTQLEPTGT VGNTIMTS QPVPN 
CTIIVLPSNVINFSQAEKPEPTNQGQDSLK^ 

GOLASASFSPNfT-QVTSTLLNSAYPFIGPfWIISGSLSIATEKRLTKLLVHSSLV 
GSILSAL^ALVGHIIJSVKQATLNPASLQCELDK^ 

ix:ytakaslagxlslmuctllefclavltavlhwkqaysdfpgsvlflph 

S YIGNSGMSS KMTtlDCG YEELLTS (SEQ ID NO: 353). Polynucleoudes encoding 
these polypeptides are also provided. 

A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MMVLSLGIILASASFSPNFTQVTSTLLNSAYPFIGPFFFI 
ISGSLSIATEKJa.TKLLVHSSLVGSIL^ALSALVGnn^VKQATLNPASLQC 
EU)KNNu^SYVSYFYHDSL\TTDCYTAKASLAGXLSLMLlCTLLEFCL 
AVLTAVLRWKQAYSDFPGSVLFLPHSYIGNSGMSSKMTHDCGYEELLTS 
(SEQ ID NO: 354). Polynucleotides encoding these polypeptides are also provided. 

The gene encoding die disclosed cDNA is believed to reside on chromosome 
L Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 1 . 

This gene is expressed primarily in immune system tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell rype(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune system diseases and/or disorders such as cancer. Similarly, 
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polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or eel! type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 204 as residues: Gln-23 to Lys-39 : Glu-150 to Thr- 
158. Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in immune cells and tissues combined with the 
homology to IgE receptor indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of immune system 
disorders. Representative uses are described in the "Immune Activity" and "infectious 
disease" sections below, in Example 11, 13, 14, 16, 18, 19, 20, and 27, and elsewhere 
herein. Briefly, the expression of this gene product indicates a role in regulating the 
proliferation; survival; differentiation; and/or activation of hematopoietic cell 
lineages, including blood stem cells. This gene product is involved in the regulation of 
cytokine production, antigen presentation, or other processes suggesting a usefulness 
in the treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatou's Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-ccll mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versusrgraft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 



WO 00/06698 



206 



PCT/US99/I7130 



influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. The gene product may 
also be involved in lymphopoiesis, therefore, it can be used in immune disorders such 
as infection, inflammation, allergy, immunodeficiency etc. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Furthermore, the protein may also be used to determine biological 
activity, raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional- 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:91 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 1 556 of SEQ ID NO:9 1. b is an 
integer of 15 to 1570, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:9i, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BV GENE NO: 82 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: GASCEGGGAAARAALGVHRSQKALLVFRRTL 
SNLLYMPLLRGLLWLQVLCAGPLHTEAVVLLVPSDDGRAFLLRSRLLHPEAH 
VPPAADRGASLQCVLHQAAPKSRPRSPAAGAALLHXPRRTGDEPCREFHGN 
GFPGPTQLTPGECGLPAPSSLLQHASAPVRTGSEGQVVGCPRARGETGEGLSL 
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AFLSSLMFTSRNGLVGC GASCEGGGAAARAALGVHRSQKALLVFRRTLSNL 
LYMPLLRGLLWLQVLCAGPLHTEAVVLLVPSDDGRAHJLRSRLLHPEAHVPP 
AAD RGASLQCVLHQAAPKSRPRSPAAGAALLHXPRRTGDEPCREFHGNGFP 
GPTQLTPGECGLPAPS S LLQHAS APVRTGSEGQ WGCPR ARGETGEGLSLA 
FLSS1MFTSRNGLVGC (SEQ ID NO: 355). Polynucleotides encoding these 
polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
7. Accordingly, polynucleotides related tn this invention are useful as a marker in 

linkage analysis for chromosome 7. 

4' 

This gene is expressed primarily in activated T cells, and to a lesser extent in a 
wide variety of human tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune and hematopoietic diseases and/or disorders, particularly 
immunodeficeincies. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 205 as residues: Pro-67 to Ser-73. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in activated T cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
immunodeficiencies. Representative uses are described in the "Immune Activity" and 
"infectious disease" sections below, in Example 11,13, 14, 16, 18, 19, 20, and 27, and 



WO 00/06698 



208 



PCT/US99/17130 



elsewhere herein. Briefly, the expression of this gene product indicates a role in 
regulating the proliferation; survival; differentiation; and/or activation of 
hematopoietic cell lineages, including blood stem cells. This gene product is involved 
in the regulation of cytokine production, antigen presentation, or other processes 
suggesting a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatou's Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft- versus- host diseases, or autoimmunity 
disorders, such as autoimmune infertility, iense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:92 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
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formula of a-b, where a is any integer between I to 2936 of SEQ ID NO:92, b is an 
integer of 1 5 to 2950, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:92, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 83 

The translation product of this gene was shown to have homology to the 
human transmembrane protein (See Genbank Accession No. gblAAC51364.1l 
(AF000959); all references and information available through this accession are 
hereby incorporated by reference herein; for example, Genomics 42 (2), 245-25 1 
(1997)) which is thought to be implicated in velo-cardio-facial syndrome. 

A preferred polypeptide fragment of the invention comprises the following 
amino acid sequence: MGSAALEILGLVLCLVGWGGLlLACGLPIVfWQVTAFLD 
HNIVTAQTI^KGLWMSCVVQSTGTCSAKCTTRCWL (SEQ ID NO: 356). 
Polynucleotides encoding these polypeptides are also provided. 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
22. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysts for chromosome 22. 

This gene is expressed primarily in dementia brain tissue, and to a lesser 
extent in a wide variety of human tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neural diseases and/or disorders, particularly dementia. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell lype(s). For 
a number of disorders of the above tissues or cells, particularly of the central nervous 
system, expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues or cell types (e.g., neural, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 206 as residues: Ser-201 to Tyr-217. Polynucleotides 
encoding said polypeptides are also provided. 

Tbe tissue distribution in dementia brain tissue, combined with the homology 
to the transmembrane protein indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of dementia, and 
potentially for velo-cardio-facial syndrome. Representative uses are described in the 
"Regeneration" and "Hyperproliferative Disorders" sections below, in Example 11, 
15, and 1 8, and elsewhere herein. Briefly, the uses include, but arc not limited to the 
detection, treatment, and/or prevention of Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating ' 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain indicates it plays a role in 
normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as ES T sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:93 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 



WO 00/06698 



211 



PCT/US99/17130 



formula of a-b, where a is any integer between 1 to 1708 of SEQ ID NO:93, b is an 
integer of 15 to 1722, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:93, and where b is greater than or equal to a + 14. 

JaATORjES OF PROTEIN ENCODED BY GENE NO: 84 

In another embodiment, polypeptides comprising the amino acid sequence of 
ihe open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: LKRAPPGPALAKCLLQPSSTFQALETNIGDQVR 
RHSTAVVIREVn*SYILISFVLXIGVGCffi 

VRMLRGELSCACYRPHVQALQLGGCTCF (SEQ TO NO: 357). Polynucleotides 
encoding these polypeptides are also provided. 

This gene is expressed primarily in the adult pulmonary system. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but arc 
not limited to, cystic fibrosis, bronchitis and any pulmonary disorders in general. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissues) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
pulmonary system, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., pulmonary, cardiovascular, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
pulmonary surfactant, pulmonary lavage/sputum, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 

The tissue distribution of this gene only in the pulmonary system indicates that 
it plays a key role in the functioning of the pulmonary system. This would suggest 
that misregulation of the expression of this protein product in the adult could lead to 
lymphoma or sarcoma formation, particularly in the lung and the protein product 
could be used either in the treatment and/or detection of these disease states. The *ene 
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or gene product may also useful in the treatment and/or detection of pulmonary 
defects such as pulmonary edema and embolism, bronchitis and cystic fibrosis. 
Furthermore, the protein may also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate Iigands or receptors, to identify agents 
that modulate their interactions, in addition to its use as a nutritional supplement. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:94 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 62 1 of SEQ ID NO:94, b is an 
integer of 15 to 635, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:94, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 85 

The translation product of this gene was found to be homologous to CAM 
proteins. Based on the sequence similarity, the translation product of this gene is 
expected to share at least some biological activities with CAM proteins. Such 
activities are known in the art, some of which are described elsewhere herein. 

A preferred polypeptide varient of the invention comprises the following 
amino acid sequence: MLCP WRTANLGLLLILTIFL V AEAEG A AQPN NSLM 
LQTSKENHALASSSLCMDEKQITQNYSKVLAEVNTSWPVKMATNAVLC 
CPPIALRNLUITWEIILRGQPSCnCAYKKETNETKETNCT 

NSDLQIRTVAITHDGYYRCIMVTPDGNFHRGYHLQVLVTPEVTLFQNRNRTA 
VCKAVAGKPAAHISWIPEGDCATKQEYWSNGTVTVKSTCHWEVHNVSTV 
NCHVSHLTGNKSLYIELLPVPGAKKSSKLYIPYIILTIIILTIVGXWLLKVNG 
CXKYKLNKPESTPVVEEDEMQPYAFYTEKNNPLXXTTNKVKASEALQSEV 
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DTDLHTL (SEQ ID NO:208). Polynucleotides encoding these polypeptides are also 
provided. 

The polypeptide of this gene has been determined to have a transmembrane 
domain at about amino acid position 271 - 287 of the amino acid sequence referenced 
in Table 1 for this gene. Moreover, a cytoplasmic tail encompassing amino acids 288 
to 548 of this protein has also been determined. Based upon these characteristics, it is 
believed that the protein product of this gene shares structural features to type la 
membrane proteins. 

This gene is expressed primarily in dendritic cells. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immunodeficiency, tumor necrosis, infection, lymphomas, auto- 
immunities, cancer, metastasis, wound healing, inflammation, anemias 
(leukemia) and other hematopoeitic disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels is routinely detected in certain tissues or cell 
types (e.g., immune, hematopoietic, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from 
an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 208 as residues: Asp-53 to Tyr-6l, Pro- 105 to Ile- 
128, Arg-133 to Leu-140, Gln-182 to Ala-188, Pro-205 to Asn-218, Gly-259 to Ala- 
264, Asn-290 to Ser-302, Glu-307 to Tyr-3 14, Ty r-3 1 7 to Lys-332. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in dendritic cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
immune disorders including; leukemias, lymphomas, auto-immunities, 
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immunodeficiencies (e.g. AIDS), immuno-supressive conditions (transplantation) and 
hematopocitic disorders. In addition this gene product is applicable in conditions of 
general microbial infection, inflammation creancer. Representative uses axe 
described in the "Immune Activity" and "infectious disease" sections below, in 
Example 11, 13, 14, 16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the 
expression of this gene product indicates a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including 
blood stem cells. This gene product is involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene ' 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to deterrnine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:95 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 3784 of SEQ ID NO:95, b is an 
integer of 15 to 3798, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:95, and where b is greater than or equal to a + 14. 

FEATURES OF PROJEIN ENCODED BY GENE NO: 86 

In another embodiment, polypeptides comprising the amino acid sequence.of 
the open reading frame upstream of the predicted signal peptide arc contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: VDCLICPAAFPVYFQDiMARGCVCSLCASVCIFLS 
SLFPLLPSVHSVNIISCLLLS1CCFEGLELMCEHL YQLSQLHVLHHIFSYLLCTP 
(SEQ ID NO: 358). Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in embryonic tissue and to a lesser extent in 
in a variety of other tissues and cell types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell typc(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental anomalies, fetal deficiencies, cancer and neoplastic 
states. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or 
ceil type(s). For a number of disorders of the above tissues or cells, particularly of the 
developing fetus, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or cell types (e.g., developmental, differentiating, 
and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
amniotic fluid, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 
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The tissue distribution in embryonic tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
developmental anomalies, fetal deficiencies and pre-natal disorders, as well as 
abnormal cell proliferation and/or differentiation, neoplastic states and cancer. 
Moreover, the expression within embryonic tissue and other cellular sources marked 
by proliferating ceils indicates this protein may play a role in the regulation of cellular 
division, and may show utility in the diagnosis, treatment, and/or prevention of 
developmental diseases and disorders, including cancer, and other proliferative 
conditions. Representative uses are described in the "Hyperproliferative Disorders" 
and "Regeneration" sections below and elsewhere herein. Briefly, developmental 
tissues rely on decisions involving cell differentiation and/or apoptosis in pattern ' 
formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:% and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2669 of SEQ ID NO:96, b is an 
integer of 15 to 2683, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:96, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED B V GENE NO: 87 

The translation product of this gene shares sequence homology with inter- 
alpha-trypsin inhibitor which is thought to be important in inhibition of trypsin and 
other serine proteases (See Genbank Accession No. pirlS30350IS3O35O; all references 
and information available through this accession are hereby incorporated herein by 
reference; for example. Eur. J. Biochem. 179 (1), 147-154 ( 1989), J. Biol. Chem. 264 
(27), 15975-15981 (1989), and J. Biol. Chem. 266 (2), 747-751 (1991)). 

Contact of cells with supernatant expressing the product of this gene has been 
shown to increase the permeability of the plasma membrane of THP- 1 ceils to 
calcium. Thus it is likely that the product of this gene is involved in a signal 
transduction pathway that is initiated when the product binds a receptor on the surface 
of the plasma membrane of both monocytes, in addition to other cell-lines or tissue 
cell types. Thus, polynucleotides and polypeptides have uses which include, but are 
not limited to, activating monocytes, and to a lesser extent, other immune and/or 
hematopoietic cells. Binding of a ligand to a receptor is known to alter intracellular 
levels of small molecules, such as calcium, potassium and sodium, as well as alter pH 
and membrane potential. Alterations in small molecule concentration caa be measured 
to identify supernatants which bind to receptors of a particular cell. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
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following amino acid sequence: YXIPGSTHASGRQRGSGRGEDDSGPPPSTVINQ 

NETFANIIFKPTVVQQARLAQNGILGDFIIRYDVNREQSIGDIQVLNGYFVHY^ 

APKDLPPLPKNVVFVLDSSASMVGTKLRQTKDALFTILHDLRPQDRFSUGFS 

MRIKVWKI)HLISVrPDSIRDGKVYTHHMSPTGGTDINGVLQRAIRLUvfKY 

SGIGDRSVSLrVFLTDG KPTVGETHTLKILNNTREAARGQVCIFTIGIGNDVD 

FRIXEK^JUSNCGLTRRVHEEEDAGSQUGFYDEIRTPLLSDIRIDYPPSSVVQ 

ATKTLFPNYFNGSEinAGKLVDRKLDHLHVEVTASNSKKFIILKTDVPVRPQK 

AGKDVTGSPRPGGDGEGDXNHIERLWSYLTTKELLSSWLQSDDEPEKERLRQ 

RAQAl^VSYRH.TPn-SMKLRGPVPRMDGLEEAHGMSAAMGPEPVVQSVR 

GAGTQPGPLLKKPYQPRIKISKTSVDGnPHFVVDFPLSRLTVCFNIDGQPGpiL 

RLVSDHRDSGVTVNGELIGAPAPPNGHKKQRTYLRTrnLmKPERSYLErfPS 

RVILDGGDRLVLPCNQSVVVGSWGLEVSVSANANVTVTIQGSIAFVILIHLYK 

KPAPFQRHHLGFYIANSEGLSSNCHGLLGQFLNQDARLTEDPAGPSQNLTHP 

LLLQVGEGPEAVLTVKGHQVPVVWKQRKIYN GEEQXDCWFARNMPPN 

(SEQ ID NO: 359). Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in placenta and adipose tissue and to a lesser 
extent in several other organs and tissues including cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differentia] identification of the tissuc(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders of developing organs and metabolic diseases, in addition to 
vascular diseases and conditions. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the developing systems and metabolic systems, 
expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., reproductive, vascular, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative 
to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 
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Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 210 as residues: Lys-5 to Lys-10, Asn-33 to Lys-39, 
Asp-48 to Lys-54. Pro-62 to Asp-67, Asn-l 16 to Arg-123, His-157 to Ala-162, Val- 
242 to Lys-249, Val-251 to Asp-264. Polynucleotides encoding said polypeptides are 
also provided. 

The tissue distribution in placenta, combined with the homology to inter- 
alpha-trypsin inhibitor and the detected calcium flux biological acttivity indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for treatment 
and diagnosis of disorders of developing and metabolic systems. This protein may 
play a role in the regulation of cellular division, and may show utility in the diagnosis, 
treatment, and/or prevention of developmental diseases and disorders, including ' 
cancer, and other proliferative conditions. Representative uses are described in the 
"Hyperproliferative Disorders" and "Regeneration" sections below and elsewhere 
herein. Briefly, developmental tissues rely on decisions involving cell differentiation 
and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
cf eel! death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and h useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Moreover, the protein 
is useful in the detection, treatment, and/or prevention of a variety of vascular 
disorders and conditions, which include, but arc not limited to miscro vascular disease, 
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vascular leak syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery 
disease, arteriosclerosis, and/or atherosclerosis. Polynucleotides and polypeptides of 
10 ^ invention are also useful for the treatment, detection, and/or prevention of 

inflammation, tumor invasion and metastasis, wound healing, liver disease, 
5 disseminated intravascular coagulation, alzheimer's Disease, ophthalmic disease, 
apoptosis, tissue remodeling, intrauterine growth retardation, preeclampsia, 
angiogenesis, cell migration, fetal development, trophoblast implantation, ovulation, 
pemphigus and psoriasis, and antiviral therapy. Furthermore, the protein may also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
2Q 10 ct >gnate Iigands or receptors, to identify agents that modulate their interactions, in 

addition to its use as a nutritional supplement. Protein, as well as, antibodies directed' 
against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 
25 Ma °y polynucleotide sequences, such as EST sequences, are publicly 

1 5 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:97 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
20 more polynucleotides comprising a nucleotide sequence described by the general 
35 formula of a-b, where a is any integer between 1 to 2 167 of SEQ ID NO:97, b is an 

integer of 15 to 218 1, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:97, and where b is greater than or equal to a + 14. 

40 25 FEATURES OF PROTEIN ENCODED BY GENE NO: 88 

The translation product of this gene was shown to have homology to the 
human colon carcinoma antigen NY-CO-7 (See Genbank and Gcneseq Accession 
45 Nos - fiblAACl 8038. II (AF039689) and WO9904265; all references available through 

this accession are hereby incorporated herein by reference; for example, Int. J. Cancer 
30 76(5), 652-658 (1998)). 

This gene is expressed primarily in breast and breast cancer and to a lesser 
50 extent in several other organs and tissues including cancers. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders of reproductive organs and the gastrointestinal system, 
including cancers. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of ihe above 
tissues or cells, particularly of the reproductive systems, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., gastrointestinal, reproductive, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, breast milk, chyme, hile, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 21 1 as residues: Gly-22 to Gly-28, Leu-7 1 to Phe-77, 
Asn-101 to Val-108, Pro-122 to Ser-127, Arg-149 to Pro-154, Gly-191 to Phe-196, 
Pro- 199 to Thr-2 1 1 . Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in breast and breast cancer tissue, combined with the 
homology to a colon cancer antigen indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of disorders of the 
reproductive systems and cancers. This protein may play a role in the regulation of 
cellular division, and may show utility in the diagnosis, treatment, and/or prevention 
of developmental diseases and disorders, including cancer, and other proliferative 
conditions. Representative uses are described in the "Hyperproliferativc Disorders" 
and "Regeneration" sections below and elsewhere herein. Briefly, developmental 
tissues rely on decisions involving ceil differentiation and/or apoptosis in pattern 
formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
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potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, anc/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the ' 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate iigands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:98 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1943 of SEQ ID NO:98, b is an 
integer of 15 to 1957, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:98, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 89 

The translation product of this gene shares sequence homology with the amino 
acid and protein sequence of a Xenopus transmembrane protein of unknown function. 
The very 5 f -end of the contig is identical to the mRNA for the human LGN mosaic 
protein. Based on the sequence similarity, the translation product of this gene is 
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expected to share at least some biological activities with LGN mosaic proteins. Such 
activities are known in the art, some of which are described elsewhere herein- 
Preferred polypeptides of the invention comprise the following amino acid sequence: 
PR VRPPTKALA VTrTTFVTEPLKHIGKGTGEFIKALM KEIP ALLHLPVLUMAL 
A1LSFCYGAGKSVHVLRHIGGPEREPPQALRPRDRRRQEEIDYRPDGGAGDAD 
FHYRGQMGPTEQGPYAKTYEGRREILRERDVDLRFQTGNKSPEVLRAFDVPD 
AEAREHPTV VPS HKS P VLDTKPKETGGILGEGTPKESSTESS QS AKP VSGQDTS 
GNTEGSPA AEKAQLKSEA AGSPDQGSTYSPARG VAGPRGQDPVSSPCG (SEQ 
ID NO:339). Polynucleotides encoding such polypeptides are also provided. 

The gene encoding the disclosed cDN A is believed to reside on chromosome 
L Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 1 . 

This gene is expressed primarily in small intestine and adipocytes and to a 
lesser extent in various other normal and transformed cell types, mostly of endocrine 
origin. 

Therefore, polynucleotides and polypeptides of the invention arc useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, conditions of growth and metabolism. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the lissue(s) or cell type(s). For a number of 
disorders of the above tissues or ceils, particularly of the digestive and endocrine 
systems, expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues or cell types (e.g., metabolic, gastrointestinal, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, bile, 
chyme, synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 212 as residues: Pro-40 to Gly-68, Gly-79 to Arg-93, 
Phe-106 toGlu-114, Pro- 122 to His-129, Thr-143 to GIy-149, Gly-155 to Aia~168, 
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Val-171 to GIy-182, Ala-195 to Pro-207, Pro-214 to Val-220. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in. small intestine indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for study and treatment of 
disorders of growth and metabolism as well as endocrine abnormalities. Furthermore, 
the protein may also be used to determine biological activity, to raise antibodies, as 
tissue markers, to isolate cognate ligands or receptors, to identify agents that modulate 
their interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:99 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1098 of SEQ ID NO:99, b is an 
integer of 15 to 1 1 12, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:99, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 90 

The translation product of this gene shares sequence homology with IgE 
receptor beta chain which is thought to be important in immune function. 
This gene is expressed primarily in kidney medulla tissue. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune and renal diseases and/or disorders. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune and renal systems, 
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expression of this gene at significantly higher or lower levels is routinely detected in 
certain tissues or cell types (e.g., immune, renal, urogenital, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from an individual having such 
a disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in kidney renal medulla tissue, combined with the 
homology to the IgE receptor beta chain indicates that polynucleotides and 
polypeptides corresponding to this gene arc useful for the treatment of immune and 
renal disorders. The protein product of this gene could be used in the treatment and/or 
detection of kidney diseases including renal failure, nephritus, renal tubular acidosis, 
proteinuria, pyuria, edema, pyelonephritis, hydronephritis, nephrotic syndrome, crush 
syndrome, glomerulonephritis, hematuria, renal colic and kidney stones, in addition to 
Wilm's Tumor Disease, and congenital kidney abnormalities such as horseshoe 
kidney, polycystic kidney, and Falconi's syndrome. Alternatively, this gene product is 
involved in the regulation of cytokine production, antigen presentation, or other 
processes suggesting a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis; Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood ceils, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
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protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 100 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one' or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 873 of SEQ ID NO: 100, b is an 
integer of 15 to 887, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 100. and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 91 

The translation product of this gene shares sequence homology with Diff 40 
gene product (See Genbank Accession No. gblAACSl 134. 1 ; all references and 
information available through this reference are hereby incorporated herein). 

Preferred polypeptides of the invention comprise the following amino acid 
sequence: PRVRSIKVTELKGLANHVVVGSVSCETKDLFAALPQVYAVDIN 
DLGTIKl^LEVWSPFDKDDQPSAASSVNKASTVTKRFSTYSQSPPDTPS 
LREQAFYNMLRRQEELENGTAWSLSSESSDDSSSPQLSGTARHSPAPRPLV 
QQPEPLPIQVAFRRPETPSSGPLDEEGAVAPVLANGHAPYSRTLSHISEASVNA 
ALAEASVEAVGPKSLSWGPSPFrHPAPTHGKHPSPVPPALDPGHSATSST 
LGTTGSVFTSTT>PAPSAHiJ)SVHK5TDSGPSELPGPTimTGSTYSAITTTHS 
APSPLTHTTTGSTHKPIISTLTTTGPTLNIIGPVQTTrSPTHTMPSPSSHSNSP 
YVDrcSSVCDNlFVHYVIGIrTOTLYSSKTL (SEQ ED NO:360), and/or PRVRS 
IKVTELKGLANHVVVGSVSCETKDLFAALPQVVAVDINDLGTIKLSLEVTWSP 
roKDDQPSAASSVNKASTVTKRFSTYSQSPPDTPSLREQAFYNMLRRQEELE 
NGTAWSLSSESSDDSSSPQLSGTARHSPAP RPLVQQPEPLPIQVAFRRPET 
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PSSGPLDEEGAVAPVLANGHAPYSRTLSHISEASVNAALAEASVEAVGPKSL 
SWGPSPPTHPAPTHGKHPSPVPPALDPGHSATSSTLGTTGS VPTSTD (SEQ ID 
NO: 361). Polynucleotides encoding these polypeptides are also provided. 
Polypeptides of the invention do not consist of the primary amino acid sequence 
shown as Geneseq Accession No.W69430, which is hereby incorporated herein by 
reference. 

This gene is expressed primarily in liver and to a lesser extent in gall bladder 

tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, metabolic and endocrine diseases and/or disorders, particularly hepatic 
and gal! bladder disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the metabolic and endocrine systems, expression of this 
gene at significantly higher or lower levels is routinely detected in certain tissues or 
cell types (e.g., hepatic, metabolic, gall bladder, gastrointestinal, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, bile, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 214 as residues: Val-9 to Cys-14, Pro^2 to Thr-47, 
Thr-56 to AIa-64. Asp-88 to His-98, Cys-128 to Ser-136, Arg-153 to Trp-161. 
Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in liver and gall bladder, combined with the homology 
to the diff 40 gene product indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the study and treatment of endocrine and 
metabolic disorders, polynucleotides and polypeptides corresponding to this gene are 
useful for the detection and treatment of liver disorders and cancers. Representative 
uses are described in the "Hyperproiiferative Disorders", "infectious disease", and 
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"Binding Activity" sections below, in Example 11, and 27, and elsewhere herein. 
Briefly, the protein can be used for the detection, treatment, and/or prevention of 
hepatoblastoma, jaundice, hepatitis, liver metabolic diseases and conditions that are 
attributable to the differentiation of hepatocyte progenitor cells. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are ' 
related to SEQ ID NO: 101 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1234 of SEQ ID NO: 101, b is an 
integer of 15 to 1248, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 101. and where b is greater than or equal to a + 1 4. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 92 

The polypeptide of this gene has been determined to have a transmembrane 
domain at about amino acid position 3 - 19 of the amino acid sequence referenced in 
Table I for this gene. Based upon these characteristics, it is believed that the protein 
product of this gene shares structural features to type II membrane proteins. 

This gene is expressed primarily in fetal brain and to a lesser extent in 
pancreas tumor, melanocyte and infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neural diseases and/or disorders, particularly neurodevelopmental 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 
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useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central nervous system, expression of this gene at significantly 
higher or lower levels is routinely detected in certain tissues or cell types (e.g., neural, 
developmental, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, amniotic fluid, urine, synovial fluid and spinal fluid) or another tissue 
or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution in fetal brain tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of / 
developmental disorders of the central nervous system. Representative uses are 
described in the "Regeneration" and "Hyperproliferative Disorders" sections below, in 
Example 1 1, 15, and 18, and elsewhere herein. Briefly, the uses include, but are not 
limited to the detection, treatment, and/or prevention of Alzheimer's Disease, 
Parkinson's Disease, Huntington's Disease, Tourettc Syndrome, meningitis, 
encephalitis, demycltnating diseases, peripheral neuropathies, neoplasia, trauma, 
congenital malformations, spinal cord injuries, ischemia and infarction, aneurysms, 
hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive compulsive 
disorder, depression, panic disorder, learning disabilities, ALS, psychoses, autism, 
and altered behaviors, including disorders in feeding, sleep patterns, balance, and 
• perception. In addition, elevated expression of this gene product in regions of the 
brain indicates it plays a role in normal neural function. 

Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. Furthermore, the protein may also be used to determine biological activity, 
to raise antibodies, as tissue markers, to isolate cognate ligands or receptors, to 
identify agents that modulate their interactions, in addition to its use as a nutritional 
supplement. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO: 102 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention arc one oi 
5 more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1827 of SEQ ID NO: 102, b is an 
integer of 15 to 1841, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:102, and where b is greater than or equal to a + 14. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 93 

The translation product of this gene shares sequence homology with a ' 
probable membrane protein YGL054c -yeast (Saccharomyccs cerevisiae). Moreover, 

The translation product of this gene also have homology to the human and 
mouse cornichon protein which is known to be necessary for both anterior-posterior 
and dorsal-ventral pattern formation in conjunction with the EGF receptor signaling 
process (See Genbank Accession Nos. gblAAC98388.1l (AF104398). and splP52l59; 
all references and information available through these accessions are hereby 
incorporated herein by reference; for example, Cell 8 1 (6). 967-978 ( 1 995)). 

The polypeptide of this gene has been determined to have two transmembrane 
domains at about amino acid position 57 - 73. and 121 - 137 of the amino acid 
sequence referenced in Table 1 for this gene. Moreover, a cytoplasmic tail 
encompassing amino acids 1 - 14 of this protein has also been determined. Based 
upon these characteristics, it is believed that the protein product of this gene shares 
structural features to type Ilia membrane proteins. 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: YGCEKTTEGG RRRRRRME A VVF VFSLLDCC AL 
IrT^\nTFini^DLECDYINARSCC5KLNKWVIPELIGHTIVTVLLLMSLHW 
IFLLNLPVATWNIYRYIMVPSGNMGWDPTBIHNRGQLK^HMKEAMIKLGFH 
LLC FFMYLYSMIL ALIND (SEQ ID NO:362). Polynucleotides encoding these 
polypeptides are also provided. 
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The gene encoding the disclosed cDNA is believed to reside on chromosome 
1. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 1 . 

This gene is expressed primarily in activated T-cells and to a lesser extent in 
endometrial tumor, T cell helper II cells, microvascular endothelial cells, Raji cells 
treated with cyciohexamide and umbilical vein endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune, hematopoietic, and vascular diseases and/or disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels is 
routinely detected in certain tissues or ceil types (e.g.. immune, hematopoietic, 
vascular, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, amniotic fluid, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 216 as residues: Ser-39 to Asn-45. Asn-103 to Ser- 
109. Polynucleotides encoding said polypeptides are also provided. 

The tissue distribution in activated T-cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 
immune disorders involving activated T-cells. Representative uses are described in 
the "Immune Activity" and "infectious disease" sections below, in Example 1 1, 13, 
14, 16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the expression of this gene 
product indicates a role in regulating the proliferation; survival: differentiation; and/or 
activation of hematopoietic cell lineages, including blood stem cells. This gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
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other processes suggesting a usefulness in the treatment of cancer (e.g. by boosting 
immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyehnation, systemic 
lupus erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Moreover, the protein 
is useful in the detection, treatment, and/or prevention of a variety of vascular 
disorders and conditions, which include, but are not limited to miscrovascular disease, 
vascular leak syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery 
disease, arteriosclerosis, and/or atherosclerosis. Furthermore, the protein may also be 
used to determine biological activity, to raise antibodies, as tissue markers, to isolate 
cognate ligands or receptors, to identify agents that modulate their interactions, in 
addition to its use as a nutritional supplement. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences arc 
related to SEQ ID NO: 103 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides arc specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from die present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
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formula of a-b, where a is any integer between 1 to 67 1 of SEQ ID NO: 103, b is an 
integer of 15 to 685, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 103, and where b is greater thaji or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 94 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: ARAPA PSLPPLPSPAPALAP AHSLLGLLLGRMS 
GSSLPSALALSLLLVSGSLLPGPGAAQNVRVQSGQDQ (SEQ ID NO: 363). 
Polynucleotides encoding these polypeptides are also provided. 

This gene is expressed primarily in dendritic celts and to a lesser extent in 
healing abdomen wound, and pancreas islet cell tumor cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents tor differential identification of the tissue(s) or cell typc(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune and hematopoietic diseases and/or disorders, particularly 
wound healing disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from 
an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 217 as residues: Gln-34 to Lys-40. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in dendritic cells and early healing wound indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for treating 
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wounds to enhance the healing process. Representative uses are described in the 
"Immune Activity" and "infectious disease" sections below, in Example 11, 13, 14, 
16, 18, 19, 20, and 27, and elsewhere herein. Briefly, the expression of this gene 
product indicates a role in regulating the proliferation; survival; differentiation; and/or 
activation of hematopoietic cell Uncages, including blood stem cells. Tliis gene 
product is involved in the regulation of cytokine production, antigen presentation, or 
other processes suggesting a usefulness in the treatment of cancer <e.g. by boosting 
immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product is involved in immune functions. Therefore it is also useful as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases sdeh 
as AIDS, leukemia, rheumatoid arthritis, granulomatous Disease, inflammatory 
bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, 
such as T-cell mediated cytotoxicity; immune reactions to transplanted organs and 
tissues, such as host-versus-graft and graft-versus-host diseases, or autoimmunity 
disorders, such as autoimmune infertility, lense tissue injury, demyelination, systemic 
lupus erythematosus, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
Disease, and scleroderma. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recruits 
hematopoietic cells to sites of injury. Thus, this gene product is thought to be useful in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Furthermore, the 
protein may also be used to determine biological activity, raise antibodies, as tissue 
markers, to isolate cognate iigands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 104 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
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cumbersome. Accordingly, preferably excluded from the present invention an; one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 1 1 54 of SEQ ID NO: 104, b is an 
integer of 1 5 to 1 168, where both a and b correspond to the positions of nucleotide 
5 residues shown in SEQ ID NO: 104, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 95 

Contact of cells with supernatant expressing the product of this gene has been 
shown to increase the permeability of the plasma membrane of aortic smooth muscle 
20 10 cells to calcium. Thus it is likely that the product of this gene is involved in a signal 

transduction pathway that is initiated when the product binds a receptor on the surface 
of the plasma membrane of both smooth muscle cells, and in other cell-lines or tissue 
cell types. Thus, polynucleotides and polypeptides have uses which include, but are 
25 not "'miled to, activating smooth muscle cells. Binding of a ligand to a receptor is 

15 known to alter intracellular levels of small molecules, such as calcium, potassium and 
sodium, as well as alter pH and membrane potential. Alterations in small molecule 
concentration can be measured to identify superoatants which bind to receptors of a 
particular cell. 

This gene is expressed primarily in pancreatic carcinoma, gall bladder and 
20 primary dendritic cells. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, metabolic and immune diseases and/or disorders, particularly cancers, 
such as pancreatic carcinoma and gall bladder tumor. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
45 disorders of the above tissues or cells, particularly of the immune system, expression 

of this gene at significantly higher or lower levels is routinely detected in certain 
30 tissues or cell types (e.g. , metabolic, immune, hematpoietic. and cancerous and . 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from an individual having such 
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a disorder, relative to the standard gene expression level, i.e.. the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 218 as residues: Lys-34 to Ile-41. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution in pancreatic carcinoma and gall bladder indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosing 
and treating cancer, such as pancreatic carcinoma and gall bladder tumors. 
Representative uses arc described here and elsewhere herein. Alternatively, the 
delected calcium flux biological activity indicates the protein is useful in the 
detection, treatment, and/or prevention of a variety of vascular disorders and 
conditions, which include, but are not limited to miscrovascular disease, vascular leak 
syndrome, aneurysm, stroke, embolism, thrombosis, coronary artery disease, 
arteriosclerosis, and/or atherosclerosis. Furthermore, die protein may also be used to 
determine biological activity, to raise antibodies, as tissue markers, to isolate cognate 
ligands or receptors, to identify agents that modulate their interactions, in addition to 
its use as a nutritional supplement. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 105 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1161 of SEQ ID NO: 105, b is an 
integer of 15 to 1 1 75, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 105, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 96 

The polypeptide of this gene has been determined to have a transmembrane 
10 domain at about amino acid position 10 - 26 of the amino acid sequence referenced in 

Table t for this gene. Moreover, a cytoplasmic tail encompassing amino acids 27 to 
5 48 of this protein has also been determined. Based upon these characteristics, it is 
believed that the protein product of this gene shares structural features to type lb 
15 membrane proteins. 

This gene is expressed primarily in osteosarcoma, wilm's tumor, ovarian 
cancer and in T-cells. 

20 10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a ' 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, inflammatory diseases and cancers, such as osteosarcoma, wilm's 
25 tumor 81,(1 ovarian cancer. Similarly, polypeptides and antibodies directed to these 

15 polypeptides are useful in providing immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of die immune system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., skeletal, renal, reproductive, immune, and cancerous and wounded tissues) or 
bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or 
35 another tissue or cell sample taken from an individual having such a disorder, relative 

to the standard gene expression level, i.e., the expression level in healthy tissue or 
bodily fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO; 219 as residues: Ser-30 to Pro-35. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution indicates that polynucleotides and polypeptides 
45 corresponding to this gene are useful for diagnosis and treatment of inflammatory 

conditions and cancer, such as osteosarcoma, wilm's rumor and ovarian cancer. 
30 Moreover, the expression within cellular sources marked by prohferating cells 

indicates this protein may play a role in the regulation of cellular division, and may 
50 show utility in the diagnosis, treatment, and/or prevention of developmental diseases 
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and disorders, including cancer, and other proliferative conditions. Representative 
uses are described in the "Hyperproliferative Disorders" and "Regeneration" sections 
below and elsewhere herein. Briefly, developmental tissues rely on decisions 
involving cell differentiation and/or apoptosis in pattern formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as a morphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate Iigands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 106 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between I to 1007 of SEQ ID NO: 106, b is an 
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integer of 15 to 1021. where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 106, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 97 

In another embodiment, polypeptides comprising the amino acid sequence of 
the open reading frame upstream of the predicted signal peptide are contemplated by 
the present invention. Specifically, polypeptides of the invention comprise the 
following amino acid sequence: GTSKDCVLYATLDPGMAVPLFLYIFTLLPLLPF 
LLSLCFSPLTVKRSSSSE^KSSL (SEQ ID NO: 364). Polynucleotides encoding 
these polypeptides are also provided. 

This gene is expressed primarily in ovarian cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissuc(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, ovarian cancer. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels is routinely detected in certain tissues or cell types 
(e.g., reproductive, ovarian, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial amniotic fluid, fluid or spinal fluid) or another tissue 
or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 220 as residues: Thr-28 to Ser-40. Polynucleotides 
encoding said polypeptides arc also provided. 

The tissue distribution in ovarian tissues indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for treating and diagnosing cancer, 
e.g., ovarian cancer. Moreover, the expression within cellular sources marked by 
proliferating cells indicates this protein may play a rote in the regulation of cellular 
division, and may show utility in the diagnosis, treatment, and/or prevention of 
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developmental diseases and disorders, including cancer, and other proliferative 
conditions. Representative uses are described in the "Hyperproliferative Disorders" 
and "Regeneration" sections below and elsewhere herein. Briefly, developmental 
tissues rely on decisions involving cell differentiation and/or apoptosis in pattern 
formation. 

Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Because of 
potential roles in proliferation and differentiation, this gene product may have 
applications in the adult for tissue regeneration and the treatment of cancers. It may 
also act as amorphogen to control cell and tissue type specification. Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. The protein is useful in 
modulating the immune response to aberrant polypeptides, as may exist in 
proliferating and cancerous cells and tissues. The protein can also be used to gain new 
insight into the regulation of cellular growth and proliferation. Furthermore, the 
protein may also be used to determine biological activity, to raise antibodies, as tissue 
markers, to isolate cognate ligands or receptors, to identify agents that modulate their 
interactions, in addition to its use as a nutritional supplement. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 107 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
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formula of a-b, where a is any integer between I to 816 of SEQ ID NO: 107, b is an 
integer of 15 to 830, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 107, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 98 

This gene is expressed primarily in macrophages and breast cancer tissue and 
to a lesser extent in osteoblasts and smooth muscle. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differcntiaj identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but. are 
not limited to, immune system dysfunction; inflammation; breast cancer; cancer; * 
osteoporosis; osteopetrosis; peristaltic disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissues) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune and skeletal 
systems, expression of this gene at significantly higher or lower levels is routinely 
detected in certain tissues or cell types (e.g., cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred polypeptides of the present invention comprise immunogenic 
epitopes shown in SEQ ID NO: 221 as residues: Giu-16 to Ala-40. Polynucleotides 
encoding said polypeptides are also provided. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and/or treatment of a variety of 
disorders. Expression in macrophages and other hematopoietic cell types indicates 
that this gene product is involved in the regulation of hematopoietic cell survival, 
proliferation, differentiation, or activation. It is involved in the control of such 
processes as immune surveillance, antigen presentation, T ceil activation, cytokine 
release, and inflammation. Expression in breast cancer tissue may possibly correlate 
with the diagnosis and differentiation of cancerous tissue from normal breast tissue. 
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Expression in osteoblasts and osteoclasts may implicate this gene protect in the 
process of bone turnover, and target it as a likely candidate for the treatment of 
osteoporosis and/or osteopetrosis. Finally, expressio in smooth muscle may indicate 
an involvement in the normal function of numerous internal organs and in the 
function of the digestive system. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 108 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention arc one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, whereais any integer between 1 to 1287 of SEQ ID NO: 108. b is an 
integer of 1 5 to 1 30 1 , where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 108, and where b is greater than or equal to a + 14. 
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Table 1 summarizes the information corresponding to each "Gene No." described 
above. The nucleotide sequence identified as **NT SEQ ID NO:X" was assembled 
from partially homologous ("overlapping") sequences obtained from the "cDNA 
clone ID" identified in Table 1 and, in some cases, from additional related DNA 
clones, '"he overlapping sequences were assembled into a single contiguous sequence 
of high redundancy (usually three to five overlapping sequences at each nucleotide 
position), resulting in a final sequence identified as SEQ ID NO:X. 

The cDNA Clone TD was deposited on the date and given the corresponding 
deposit number listed io **ATCC Deposit No:Z and Date." Some of the deposits 
contain multiple different clones corresponding to the same gene. "Vector" refers to 
the type of vector contained in the cDNA Clone ID. * 

'Total NT Seq." refers to the total number of nucleotides in the contig 
identified by "Gene No." The deposited clone may contain all or most of these 
sequences, reflected by the nucleotide position indicated as "5* NT of Clone Seq." 
and the "3' NT of Clone Seq." of SEQ ID NO:X. The nucleotide position of SEQ ID 
NO:X of the putative start codon (methionine) is identified as "5* NT of Start Codon " 
Similarly , the nucleotide position of SEQ ID NO:X of the predicted signal sequence 
is identified as "5' NT of First AA of Signal Pep." 

The translated amino acid sequence, beginning with the methionine, is 
identified as "A A SEQ ID NO:Y," although other reading frames can also be easily 
translated using known molecular biology techniques. The polypeptides produced by 
these alternative open reading frames are specifically contemplated by the present 
invention. 

The first and last amino acid position of SEQ ID NO: Y of the predicted signal 
peptide is identified as "First AA of Sig Pep" and "Last AA of Sig Pep." The 
predicted first amino acid position of SEQ ID NO: Y of the secreted portion is 
identified as "Predicted First AA of Secreted Portion." Finally, the amino acid 
position of SEQ ID NO: Y of the last amino acid in the open reading frame is 
identified as "Last AA of ORE" 

SEQ ID NO:X and the translated SEQ ID NO: Y are sufficiently accurate and 
otherwise suitable for a variety of uses well known in the art and described further 
below. For instance, SEQ ID NO:X is useful for designing nucleic acid hybridization 
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probes that will detect nucleic acid sequences contained in SEQ ID NO:X or the 
cDNA contained in the deposited clone. These probes will also hybridize to nucleic 
acid molecules in biological samples, thereby enabling a variety of forensic and 
diagnostic methods of the invention. Similarly, polypeptides identified from SEQ ID 
NO: Y may be used to generate antibodies which bind specifically to the secreted 
proteins encoded by the cDNA clones identified in Table 1. 

Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or 
deletions of nucleotides in the generated DNA sequence. The erroneously inserted or 
deleted nucleotides cause frame shifts in the reading frames of the predicted amino 
acid sequence. In these cases, the predicted amino acid sequence diverges from the - 
actual amino acid sequence, even though the generated DNA sequence may be greater 
than 99.9% identical to the actual DNA sequence (for example, one base insertion or 
deletion in an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide 
sequence or the amino acid sequence, the present invention provides not only the 
generated nucleotide sequence identified as SEQ ID NO:X and the predicted 
translated amino acid sequence identified as SEQ ID NO:Y, but also a sample of 
plasmid DNA containing a human cDNA of the invention deposited with the ATCC, 
as set forth in Table 1. The nucleotide sequence of each deposited clone can readily 
be determined by sequencing the deposited clone in accordance with known methods. 
The predicted amino acid sequence can then be verified from such deposits. 
Moreover, the amino acid sequence of the protein encoded by a particular clone can 
also be directly determined by peptide sequencing or by expressing the protein in a 
suitable host cell containing the deposiied human cDNA, collecting the protein, and 
determining its sequence. 

The present invention also relates to the genes corresponding to SEQ ID 
NO:X, SEQ ID NO: Y, or the deposited clone. The corresponding gene can be 
isolated in accordance with known methods using the sequence information disclosed 
herein. Such methods include preparing probes or primers from the disclosed 
sequence and identifying or amplifying the corresponding gene from appropriate 
sources of genomic material. 
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Also provided in the present invention are species homologs. Species 
homologs may be isolated and identified by making suitable probes or primers from 
the sequences provided herein and screening a suitable nucleic acid source for the 
desired homologue. 

The polypeptides of the invention can be prepared in any suitable manner. 
Such polypeptides include isolated naturally occurring polypeptides, recombinantly 
produced polypeptides, synthetically produced polypeptides, or polypeptides 
produced by a combination of these methods. Means for preparing such polypeptides 
are well understood in the art. 

The polypeptides may be in the form of the secreted protein, including the 
mature form, or may be a part of a larger protein, such as a fusion protein (see below). ' 
It is often advantageous lo include an additional amino acid sequence which contains 
secretory or leader sequences, pro-sequences, sequences which aid in purification , 
such as multiple histidine residues, or an additional sequence for stability during 
recombinant production. 

The polypeptides of the present invention are preferably provided in an 
isolated form, and preferably are substantially purified. A recombinantly produced 
version of a polypeptide, including the secreted polypeptide, can be substantially 
purified by the one-step method described in Smith and Johnson, Gene 67:31-40 
(1988). Polypeptides of the invention also can be purified from natural or 
recombinant sources using antibodies of the invention raised against the secreted 
protein in methods which are well known in the art. 

Signal Seqmrnyfs 

Methods for predicting whether a protein has a signal sequence, as well as the 
cleavage point for that sequence, are available. For instance, the method of 
McGeoeh, Virus Res. 3:271-286 (1985), uses the information from a short N-terminal 
charged region and a subsequent uncharged region of the complete (uncleaved) 
protein. The method of von Heinje, Nucleic Acids Res. 14:4683-4690 (1986) uses the 
information from the residues surrounding the cleavage site, typically residues -13 to 
+2, where +1 indicates the amino terminus of the secreted protein. The accuracy of 
predicting the cleavage points of known mammalian secretory proteins for each of 
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these methods is in the range of 75-80%. (von Hcinje, supra.) However, the two 
methods do not always produce the same predicted cleavage point(s) for a given 
protein. 

In the present case, the deduced amino acid sequence of the secreted 
polypeptide was analyzed by a computer program called SignalP (Henrik Nielsen et 
al.. Protein Engineering 10:1-6 (1997)), which predicts the cellular location of a 
protein based on the amino acid sequence. As part of this computational prediction of 
localization, the methods of McGeoch and vonHeinje are incorporated. The analysis 
of the amino acid sequences of the secreted proteins described herein by this program 
provided the results shown in Table 1. 

As one of ordinary skill would appreciate, however, cleavage sites sometimes 
vary from organism to organism and cannot be predicted with absolute certainty. 
Accordingly, the present invention provides secreted polypeptides having a sequence 
shown in SEQ ID NO: Y which have an N-terminus beginning within 5 residues (i.e., 
+ or - 5 residues) of the predicted cleavage poinL Similarly, it is also recognized that 
in some cases, cleavage of the signal sequence from a secreted protein is not entirely 
uniform, resulting in more than one secreted species. These polypeptides, and the 
polynucleotides encoding such polypeptides, are contemplated by the present 
invention. 

Moreover, the signal sequence identified by the above analysis may not 
necessarily predict the naturally occurring signal sequence. For example, the 
naturally occurring signal sequence may be further upstream from the predicted signal 
sequence. However, it is likely that the predicted signal sequence will be capable of 
directing the secreted protein to the ER. These polypeptides, and the polynucleotides 
encoding such polypeptides, are contemplated by the present invention. 

Polynucle otide and Polypeptide Variants 

"Variant" refers to a polynucleotide or polypeptide differing from the 
polynucleotide or polypeptide of the present invention, but retaining essential 
properties thereof. Generally, variants are overall closely similar, and, in many 
regions, identical to the polynucleotide or polypeptide of the present invention. 

By a polynucleotide having a nucleotide sequence at least, for example, 95% 
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"identical" to a reference nucleotide sequence of the present invention, it is intended 
that the nucleotide sequence of the polynucleotide is identical to the reference 
10 sequence except that the polynucleotide sequence may include up to five point 

mutations per each 100 nucleotides of the reference nucleotide sequence encoding the 
5 polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence 
at least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides 
15 in tne reference sequence may be deleted or substituted with another nucleotide, or a 

number of nucleotides up to 5% of the total nucleotides in the reference sequence may 
be inserted into the reference sequence. The query sequence may be an entire 
20 10 ^encc shown inTable I , the ORF (open reading frame), or any fragement specified 

as described herein. 

As a practical matter, whether any particular nucleic acid molecule or 
polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 
computer programs. A preferred method for determing the best overall match 
between a query sequence (a sequence of the present invention) and a subject 
sequence, also referred to as a global sequence alignment, can be determined using 
'.he FASTD3 computer program based on the algorithm of BrutJag et al. (Comp. App. 
Biosci. (1990) 6:237-245). In a sequence alignment the query and subject sequences 
are both DNA sequences. An RNA sequence can he compared by convening ITs to 
35 T*s. The result of said global sequence alignment is in percent identity. Preferred 

parameters used in a FASTDB alignment of DNA sequences to calculate percent 
identiy are. Matrix==Unitary, k-tuple=4, Mismatch Penalty^!, Joining Penalty=30, 

Randomization Group Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size Penaltv 
AO ' 

25 0.05, Window Size=500 or the lenght of the subject nucleotide sequence, whichever is 

shorter. 

If the subject sequence is shorter than the query sequence because of 5" or 3' 
45 deletions, not because of internal deletions, a manual correction must be made to the 

results. This is because the FASTDB program does not account for 5 ? and 3' 
truncations of the subject sequence when calculating percent identity. For subject 
sequences truncated at the 5* or 3' ends, relative to the the query sequence, the 
percent identity is corrected by calculating the number of bases of the query sequence 
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that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent 
of the total bases of the query sequence. Whether a nucleotide is matched/aligned is 
determined by results of the FASTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This corrected 
score is what is used for the purposes of the present invention. Only bases outside the 
5' and 3' bases of the subject sequence, as displayed by the FASTDB alignment, 
which arc not matched/aligned with the query sequence, are calculated for the 
purposes of manually adjusting the percent identity score. 

For example, a 90 base subject sequence is aligned to a 100 base query 
sequence to determine percent identity. The deletions occur at the 5' end of the 
subject sequence and therefore, the FASTDB alignment does not show a 
maiched/alignement of the first 10 bases at 5* end. The 10 unpaired bases represent 
10% of the sequence (number of bases at the 5* and 3' ends not matched/total number 
of bases in the query sequence) so 10% is subtracted from the percent identity score 
calculated by the FASTDB program. If the remaining 90 bases were perfectly 
matched the final percent identity would be 90%. In another example, a 90 base 
subject sequence is compared with a 100 base query sequence. This time the 
deletions are internal deletions so that there are no bases on the 5' or 3* of the subject 
sequence which are not matched/aligned with the query. In this case the percent 
identity calculated by FASTDB is not manually corrected. Once again, only bases 5* 
and 3' of the subject sequence which are not matched/aligned with the query sequnce 
are manually corrected for. No other manual corrections are to made for the purposes 
of the present invention. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
identical" to a query amino acid sequence of the present invention, it is intended that 
the amino acid sequence of the subject polypeptide is identical to the query sequence 
except that the subject polypeptide sequence may include up to five amino acid 
alterations per each 100 amino acids of the query amino acid sequence. In other 
words, to obtain a polypeptide having an amino acid sequence at least 95% identical 
to a query amino acid sequence, up to 5% of the amino acid residues in the subject 
sequence may be inserted, deleted, (indels) or substituted with another amino acid. 
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These alterations of the reference sequence may occur at the amino or carboxy 
terminal positions of the reference amino acid sequence or anywhere between those 
terminal positions, interspersed either individually among residues in the reference 
sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 95%, 
96%, 97%, 98% or 99% identical to, for instance, the amino acid sequences shown in 
Table 1 or to the amino acid sequence encoded by deposited DNA clone can be 
determined conventionally using known computer programs. A preferred method for 
determing the best overall match between a query sequence (a sequence of the present 
invention) and a subject sequence, also referred to as a global sequence alignment, 
can be determined using the FASTDB computer program based on the algorithm of ' 
Brutlag et al. (Comp. App. Biosci. (1990) 6:237-245). In a sequence alignment the 
query and subject sequences are either both nucleotide sequences or both amino acid 
sequences. The result of said global sequence alignment is in percent identity. 
Preferred parameters used in a FASTDB amino acid alignment are: Matrix=PAM 0, 
k-tuple=2. Mismatch Penalty^ 1 , Joining Penalty=20, Randomization Group 
Length=0, Cutoff Score= I, Window Size=sequence length, Gap Penalty=5, Gap Size 
Pena!:y=C.C5, Window Size=500 or the length of the subject amino acid sequence, 
whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
terminal deletions, not because of internal deletions, a manual correction must be 
made to the results. This is becuase the FASTDB program does not account for N- 
and C-terminal truncations of the subject sequence when calculating global percent 
identity. For subject sequences truncated at the N- and C-termini. relative to the the 
query sequence, the percent identity is corrected by calculating the number of residues 
of the query sequence that arc N- and C-terminal of the subject sequence, which are 
not matched/aligned with a corresponding subject residue, as a percent of the total 
bases of the query sequence. Whether a residue is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from 
the percent identity, calculated by the above FASTDB program using the specified 
parameters, to arrive at a final percent identity score. This final percent identity score 
is what is used for the purposes of the present invention. Only residues to the N- and 
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C-termini of the subject sequence, which are not matched/aligned with the query 
sequence, are considered for the purposes of manually adjusting the percent identity 
10 scor e- That is, only query residue positions outside the farthest N- and C-terminai 

residues of the subject sequence. 
5 For example, a 90 amino acid residue subject sequence is aligned with a 100 

residue query sequence to determine percent identity. The deletion occurs at the N- 
terminus of the subject sequence and therefore, the FASTDB alignment does not 
show a matching/alignment of the first 10 residues at the N-terminus. The 10 
unpaired residues represent lOJb of the sequence (number of residues at the N- and C- 
20 10 termini n °* matched/total number of residues in the query sequence) so 1 0% is 

subtracted from the percent identity score calculated by the FASTDB program. If the ' 
remaining 90 residues were perfectly matched the final percent identity would be 
90%. In another example, a 90 residue subject sequence is compared with a 100 
residue query sequence. This time the deletions are internal deletions so there are no 
15 residues at the N- or C-termini of the subject sequence which are not matched/aligned 
with the query. In this case the percent identity calculated by FASTDB is not 
manually corrected. Once again, only residue positions outside the N- and C-terminal 
ends of the subject sequence, as displayed in the FASTDB alignment, which are not 
matched/aligned with the query sequnce are manually corrected for. No other manual 
20 corrections are to made for the purposes of the present invention. 
35 The va riants may contain alterations in the coding regions, non-coding 

regions, or both. Especially preferred are polynucleotide variants containing 
alterations which produce silent substitutions, additions, or deletions, but do not alter 
the properties or activities of the encoded polypeptide. Nucleotide variants produced 
25 by silent substitutions due to the degeneracy of the genetic code are preferred. 

Moreover, variants in which 5-10, 1-5, or 1 -2 amino acids are substituted, deleted, or 
added in any combination are also preferred. Polynucleotide variants can be produced 
45 for a variel y of masons, e.g., to optimize codon expression for a particular host 

(change codons in the human mRNA to those preferred by a bacteria] host such as E. 
30 coli). 

Naturally occurring variants are called "allelic variants." and refer to one of 
several alternate forms of a gene occupying a given locus on a chromosome of an 
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organism. (Genes n, Lewin, B., ed., John Wiley & Sons, New York (1985).) These 
allelic variants can vary at either the polynucleotide and/or polypeptide level. 
Alternatively, non-natural ly occurring variants may be produced by mutagenesis 
techniques or by direct synthesis. 

Using known methods of protein engineering and recombinant DNA 
technology, variants may be generated to improve or alter the characteristics of the 
polypeptides of the present invention. For instance, one or more amino acids can be 
deleted from the N-terminus or C-terminus of the secreted protein without substantial 
loss of biological function. The authors of Ron et al M J. Biol. Chem. 268: 2984-2988 
(1993), reported variant KGF proteins having heparin binding activity even after 
deleting 3, 8, or 27 amino-terminal amino acid residues. Similarly, Interferon gamma 
exhibited up to ten times higher activity after deleting 8-10 amino acid residues from 
the carboxy terminus of this protein. (Dobeli et ah, J. Biotechnology 7: 199-216 
(1988).) 

Moreover, ample evidence demonstrates that variants often retain a biological 
activity similar to that of the naturally occurring protein. For example, Gayte arid 
coworkers (J. Bioi. Chem 268:22105-2211 1 (1993)) conducted extensive mutational 
ar.a!ys:s of human cytokine IL-1 a. They used random mutagenesis to generate over 
3,500 individual IL-1 a mutants that averaged 2.5 amino acid changes per variant over 
the entire length of the molecule. Multiple mutations were examined at every 
possible amino acid position. The investigators found that "Most of the molecule 
could be altered with little effect on either [binding or biological activity]." (See, 
Abstract.) In fact, only 23 unique amino acid sequences, out of more than 3,500 
nucleotide sequences examined, produced a protein that significantly differed in 
activity from wild-type. 

Furthermore, even if deleting one or more amino acids from the N-terminus or 
C-terminus of a polypeptide results in modification or loss of one or more biological 
functions, other biological activities may still be retained. For example, the ability of 
a deletion variant to induce and/or to bind antibodies which recognize the secreted 
form will likely be retained when less than the majority of the residues of the secreted 
form are removed from the N-terminus or C-terminus. Whether a particular 
polypeptide lacking N- or C-terminal residues of a protein retains such immunogenic 
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activities can readily be determined by routine methods described herein and 
otherwise known in the art 
10 Thus, the invention further includes polypeptide variants which show 

substantial biological activity. Such variants include deletions, insertions, 
5 inversions, repeats, and substitutions selected according to general rules known in the 
an so as have little effect on activity. For example, guidance concerning how to make 
phenotypically silent amino acid substitutions is provided in Bowie, J. U. et aJ., 
Science 247: 1306-13 10 (1990), wherein the authors indicate that there are two main 
strategies for studying the tolerance of an amino acid sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
selection during the process of evolution. By comparing amino acid sequences in 
different species, conserved amino acids can be identified. These conserved amino 
acids are likely important for protein function. In contrast, the amino acid positions 
where substitutions have been tolerated by natural selection indicates that these 
1 5 positions are not critical for protein function. Thus, positions tolerating amino acid 
substitution could be modified wliile still maintaining biological activity of the 
protein. 

The second strategy uses genetic engineering to introduce amino acid changes 
at specific positions of a cloned gene to identify regions critical for protein function. 
20 For example, site directed mutagenesis or alantne-scanning mutagenesis (introduction 
35 of single alanine mutations at every residue in the molecule) can be used. 

(Cunningham and Wells, Science 244:1081-1085 (1989).) The resulting mutant 
molecules can then be tested for biological activity. 

As the authors state, these two strategies have revealed that proteins are 
surprisingly tolerant of amino acid substitutions. The authors further indicate which 
amino acid changes are likely to be permissive at certain amino acid positions in the 
protein. For example, most buried (within the tertiary structure of the protein) amino 
45 acid rcsidues require nonpolar side chains, whereas few features of surface side chains 

are generally conserved. Moreover, tolerated conservative amino acid substitutions 
involve replacement of the aliphatic or hydrophobic amino acids Ala, Val ; Leu and 
He; replacement of the hydroxyl residues Ser and Thr; replacement of the acidic 
residues Asp and Glu; replacement of the amide residues Asn and Gin. replacement of 
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the basic residues Lys, Arg, and His; replacement of the aromatic residues Phe, Tyr, 
and Tip, and replacement of the small-sized amino acids Ala, Ser, Thr, Met, and Gly. 



Besides conservative amino acid substitution, variants of the present invention 
5 include (i) substitutions with one or more of the non-conserved amino acid residues, 
where the substituted amino acid residues may or may not be one encoded by the 
genetic code, or (ii) substitution with one or more of amino acid residues having a 
substitucnt group, or (iii) fusion of the mature polypeptide with another compound, 
such as a compound to increase the stability and/or solubility of the polypeptide (for 
20 10 exara P Ie * polyethylene glycol), or (i v) fusion of the polypeptide with additional amino 

acids, such as an IgG Fc fusion region peptide, or leader or secretory sequence, or a 
sequence facilitating purification. Such variant polypeptides are deemed to be within 
the scope of those skilled in the art from the teachings herein. 

For example, polypeptide variants containing amino acid substitutions of 
1 5 charged amino acids with other charged or neutral amino acids may produce proteins 
with improved characteristics, such as less aggregation. Aggregation of 
pharmaceutical formulations both reduces activity and increases clearance due to the 
aggregate's immunogenic activity. (Pinckard et al., Clin. Exp. Immunol. 2:331-340 
(1967); Robbins et a!., Diabetes 36: 838-845 (1987); Cleland et al., Crit. Rev. 
20 Therapeutic Drug Carrier Systems 10:307-377 (1993).) 
35 A further embodiment of the invention relates to a polypeptide which 

comprises the amino acid sequence of the present invention having an amino acid 
sequence which contains at least one amino acid substitution, but not more than 50 
amino acid substitutions, even more preferably, not more than 40 amino acid 
25 substitutions, still more preferably, not more than 30 amino acid substitutions, and 
still even more preferably, not more than 20 amino acid substitutions. Of course, in 
order of ever- increasing preference, it is highly preferable for a polypeptide to have 
45 m arnino acid sequence which comprises the amino acid sequence of the present 

invention, which contains at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 
30 amino acid substitutions. In specific embodiments, the number of additions, 
substitutions, and/or deletions in the amino acid sequence of the present invention or 
fragments thereof (e.g., the mature form and/or other fragments described herein), is 
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1-5, 5-10, 5-25, 5-50, 10-50 or 50-150, conservative amino acid substitutions are 
preferable. 



Polynucleotide and Polype ptide Fragments 

5 In the present invention, a "polynucleotide fragment" refers to a short 

polynucleotide having a nucleic acid sequence contained in the deposited clone or 
shown in SEQ ID NO:X. The short nucleotide fragments are preferably at least about 
15 nt, and more preferably at least about 20 nt, still more preferably at least about 30 
nt, and even more preferably, at least about 40 nt in length. A fragment "at least 20 nt 
20 10 in Ien S th -" for example, is intended to include 20 or more contiguous bases from the 

cDNA sequence contained in the deposited clone or the nucleotide sequence shown in 
SEQ ID NO:X. These nucleotide fragments are useful as diagnostic probes and 
primers as discussed herein. Of course, larger fragments (e.g., 50, 1 50, 500, 600, 
25 2000 nucleotides) are preferred. 

15 Moreover, representative examples of polynucleotide fragments of the 

invention, include, for example, fragments having a sequence from about nucleotide 
number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401- 
450, 451-500, 501-550, 551-600, 651-700, 701-750, 751-800, 800-850, 851-900, 901- 
950,951-1000, 1001-1050, 3051-1 100, 1 101-1150, 1151-1200, 1201-1250, 1251- 
20 1300, 1301-1350, 1351-1400, 1401-1450, 1451-1500, 1501-1550, 1551-1600, 1601- 
35 ^50, 1651-1700, 1701-1750, 1751-1800, 180M850, 1851-1900, 1901-1950, 1951- 

2000, or 2001 to the end of SEQ ID NO:X or the cDNA contained in the deposited 
clone. In this context "about" includes the particularly recited ranges, larger or 
smaller by several (5, 4, 3 , 2, or i ) nucleotides, at either terminus or at both termini. 
25 Preferably, these fragments encode a polypeptide which has biological activity. More 
preferably, these polynucleotides can be used as probes or primers as discussed 
herein. 

45 In the P re senl invention, a "polypeptide fragment" refers to a short amino acid 

sequence contained in SEQ ID NO:Y or encoded by the cDNA contained in the 
30 deposited clone. Protein fragments may be "free-standing," or comprised within a 
larger polypeptide of which the fragment forms a part or region, most preferably as a 
single continuous region. Representative examples of polypeptide fragments of the 



50 



55 



15 



WO 00/06698 PCTVUS99/17130 
5 274 

invention, include, for example, fragments from about amino acid number 1-20, 21- 
40, 41-60, 61-80. 81-100, 102-120, 121-140. 141-160, or 161 to the end of the coding 
10 region. Moreover, polypeptide fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 

100. 110, 120, 130, 140, or 150 amino acids in length. In this context "about" 
5 includes the particularly recited ranges, larger or smaller by several (5, 4, 3. 2, or 1) 
anuno acids, at either extreme or at both extremes. 

Preferred polypeptide fragments include the secreted protein as well as the 
mature form. Further preferred polypeptide fragments include the secreted protein or 
the mature form having ^continuous series of deleted residues from the amino or the 
20 10 carbox y terminus, or both. For example, any number of amino acids, ranging from. 1 - 

60, can be deleted from the amino terminus of either the secreted polypeptide or the 
mature form. Similarly, any number of amino acids, ranging from 1 -30, can be 
deleted from the carboxy terminus of the secreted protein or mature form. 
Furthermore, any combination of the above amino and carboxy terminus deletions are 
15 preferred. Similarly, polynucleotide fragments encoding these polypeptide fragments 
are also preferred. 

Also preferred are polypeptide and polynucleotide fragments characterized by 
structural or functional domains, such as fragments that comprise alpha-helix and 
alpha-helix forming regions, beta sheet and bcta-shect-forming regions, turn and turn- 
20 forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic 
35 regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, 

surface-forming regions, substrate binding region, and high antigenic index regions. 
Polypeptide fragments of SEQ ID NO. Y falling within conserved domains are 
specifically contemplated by the present inveniion. Moreover, polynucleotide 
25 fragments encoding these domains are also contemplated. 

Other preferred fragments are biologically active fragments. Biologically 
active fragments are those exhibiting activity similar, but not necessarily identical, to 
45 311 aclivit y of lhe polypeptide of the present invention. The biological activity of the 

fragments may include an improved desired activity, or a decreased undesirable 
30 activity. 

50 Epitopes A 4ntihofiiPs: 
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In the present invention, "epitopes" refer to polypeptide fragments having 
antigenic or immunogenic activity in an animal, especially in a human. A preferred 
10 embodiment of the present invention relates to a polypeptide fragment comprising an 

epitope, as well as the polynucleotide encoding this fragment. A region of a protein 
5 molecule to which an antibody can bind is defined as an "antigenic epitope." In 
contrast, an "immunogenic epitope" is defined as a part of a protein that elicits an 
antibody response. (See, for instance, Geysen et ah, Proc. Natl. Acad. Sci. USA 
81:3998- 4002(1983).) 

Fragments which function as epitopes may be produced by any conventional 
20 10 means ' (See > e *S- Houghten. R. A., Proc. Nail. Acad. Sci. USA 82:5131-5135 (1985) 

further described in U.S. Patent No. 4,63 1,21 1.) 

In the present invention, antigenic epitopes preferably contain a sequence of at 
least seven, more preferably at least nine, and most preferably between about 15 to 
about 30 amino acids. Antigenic epitopes are useful to raise antibodies, including 
monoclonal antibodies, that specifically bind the epitope. (See. for instance, Wilson 
et ai., Cell 37:767-778 (1984); Smcliffe, J. G. et al., Science 219:660-666 (1983);) 

Similarly, immunogenic epitopes can be used to induce antibodies according 
to methods well known in the art. (See, for instance, Smcliffe et al., supra; Wilson et 
al.. supra; Chow, M. et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F. J. el 
al., J. Gen. Virol. 66:2347-2354 (1985).) A preferred immunogenic epitope includes 
35 tne secreted protein. The immunogenic epitopes may be presented together with a 

carrier protein, such as an albumin, to an animal system (such as rabbit or mouse) or, 
if it is long enough (at least about 25 amino acids), without a carrier. However, 
immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to 
be sufficient to raise antibodies capable of binding to, at the very least, linear epitopes 
in a denatured polypeptide (e.g., in Western blotting.) 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 
45 mcant 10 inclu de intact molecules as well as antibody fragments (such as, for 

example. Fab and F(ab')2 fragments) which are capable of specifically binding to 
protein. Fab and F(ab')2 fragments lack the Fc fragment of intact antibody, clear 
more rapidly from the circulation, and may have less non-specific tissue binding than 
an intact antibody. (Wahl et aL, J. Nucl. Med. 24:316-325 (1983).) Thus, these 
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fragments are preferred, as well as the products of a FAB or other immunoglobulin 
expression library. Moreover, antibodies of the present invention include chimeric, 
single chain, and humanized antibodies. 



Fusion Proteins 

Any polypeptide of the present invention can be used to generate fusion 
proteins. For example, the polypeptide of the present invention, when fused to a 
second protein, can be used as an antigenic tag. Antibodies raised against the 
polypeptide of the present invention can be used to indirectly detect the second 
protein by binding to the polypeptide. Moreover, because secreted proteins target 
cellular locations based on trafficking signals, the polypeptides of the present 
invention can be used as targeting molecules once fused to other proteins. 

Examples of domains that can be fused to polypeptides of the present 
invention include not only heterologous signal sequences, but also other heterologous 
functional regions. The fusion does not necessarily need to be direct, but may occur 
through linker sequences. 

Moreover, fusion proteins may also be engineered to improve characteristics 
cf the polypeptide of the present invention. For instance, a region of additional amino 
acids, particularly charged amino acids, may be added to the N-terminus of the 
polypeptide to improve stability and persistence during purification from the host cell 
or subsequent handling and storage. Also, peptide moieties may be added to the 
polypeptide to facilitate purification. Such regions may be removed prior to final 
preparation of the polypeptide. The addition of peptide moieties to facilitate handling 
of polypeptides are familiar and routine techniques in the art. 

Moreover polypeptides of the present invention, including fragments, and 
specifically epitopes, can be combined with parts of the constant domain of 
immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 
facilitate purification and show an increased half-life in vivo. One reported example 
describes chimeric proteins consisting of the first two domains of the human CD4- 
polypeptide and various domains of the constant regions of the heavy or light chains 
of mammalian immunoglobulins. (EP A 394,827; Traunecker et aL, Nature 33 1 :84- 
86 (1988).) Fusion proteins having disulfide-Jinked dimeric structures (due to the 
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IgG) can also be more efficient in binding and neutralizing other molecules, than the 
monomelic secreted protein or protein fragment alone. (Fountoulakis ct ah, J. 
Biochem. 270:3958-3964 (1995).) 

Similarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion 
proteins comprising various portions of constant region of immunoglobulin molecules 
together with another human protein or part thereof In many cases, the Fc part in a 
fusion protein is beneficial in therapy and diagnosis, and thus can result in, for 
example, improved pharmacokinetic properties. (EP-A 0232 262.) Alternauvely, 
deleting the Fc part after die fusion protein has been expressed, detected, and purified, 
would be desired. For example, the Fc portion may hinder therapy and diagnosis if 
the fusion protein is used as an antigen for immunizations. In drug discovery, for ' 
example, human proteins, such as hIL-5, have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists of hIL-5. (See, 
D. Bennett et al., J. Molecular Recognition 8:52-58 (1995); K. Johanson et al., J. Biol. 
Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of the present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of the fused polypeptide. 
In preferred embodiments, the marker amino acid sequence is a hexa-histidine 
peptide, such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, 
Chatsworth, CA, 9 1 3 1 1 ), among others, many of which are commercially available. 
As described in Gentz et al., Proc. Natl. Acad. Sci. USA 86:821-824 (1989). for 
instance, hexa-histidine provides for convenient purification of the fusion protein. 
Another peptide tag useful for purification, the "HA" tag, corresponds to an epitope 
derived from the influenza hemagglutinin protein. (Wilson et al.. Cell 37:767 
(1984).) 

Thus, any of these above fusions can be engineered using the polynucleotides 
or the polypeptides of the present invention. 

Vectors. Host Cells, and Pr otein PrnHnrrinn 

The present invention also relates to vectors containing the polynucleotide of 
the present invention, host cells, and the production of polypeptides by recombinant 
techniques. The vector may be, for example, a phage, plasmid, viral, or retroviral 
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vector Retroviral vectors may be replication competent or replication defective. In 
the latter case, viral propagation generally will occur only in complementing host 
cells. 



The polynucleotides may be joined to a vector containing a selectable marker 
for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, 
such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the 
vector is a virus, it may be packaged in vitro using an appropriate packaging cell line 
and then transduced into host cells. 

The polynucleotide^ insert should be operatively linked to an appropriate 
20 10 P romoter > such ^ the phage lambda PL promoter, the E. coli lac, tip, phoA and tac . 

promoters, the SV40 early and late promoters and promoters of retroviral LTRs, to ' 
name a few. Other suitable promoters will be known to the skilled artisan. The 
expression constructs will further contain sites for transcription initiation, termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding 
portion of the transcripts expressed by the constructs will preferably include a 
translation initiating codon at the beginning and a termination codon (TJAA, UGA or 
UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolatc reductase, G418 or neomycin 
resistance for eukaryotic cell culture and tetracycline, kanamycin or ampicillin 
35 resistance genes for culturing in E. coii and other bacteria. Representative examples 

of appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 
Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; 
insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as 
CHO, COS, 293, and Bowes melanoma cells; and plant cells. Appropriate culture 
mediums and conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQEoO and pQE- 
45 9 ' avaUablc from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, 

pNH16a, pNH18A. pNH46A, available from Stratagene Cloning Systems, Inc.; and 
ptrc99a, pKK223-3, P KK233-3, pDR540, P RIT5 available from Pharmacia Biotech, 
Inc. Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl 
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and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL available 
from Pharmacia. Other suitable vectors will be readily apparent to the skilled artisan. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, eiectroporauon, transduction, infection, or other methods. Such methods 
arc described in many standard laboratory manuals, such as Davis et al., Basic 
Methods In Molecular Biology (1986). It is specifically contemplated that the 
polypeptides of the present invention may in fact be expressed by a host cell lacking a 
recombinant vector. ; 

A polypeptide of this invention can be recovered and purified from 
recombinant cell cultures by well-known methods including ammonium sulfate or 
ethanol precipitation, acid extraction, anion or cation exchange chromatography, 
phosphoceliulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
preferably, high performance liquid chromatography ("HPLC") is employed for 
purification. 

Polypeptides of the present invention, and preferably the secreted form, can 
also be recovered from: products purified from natural sources, including bodily 
fluids, tissues and cells, whether directly isolated or cultured; products of chemical 
synthetic procedures; and products produced by recombinant techniques from a 
prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, 
insect, and mammalian cells. Depending upon the host employed in a recombinant 
production procedure, the polypeptides of the present invention may be glycosylated 
or may be non-glycosylated. In addition, polypeptides of the invention may also 
include an initial modified methionine residue, in some cases as a result of host- 
mediated processes. Thus, it is well known in the art that the N-terminal methionine 
encoded by the translation initiation codon generally is removed with high efficiency 
from any protein after translation in all eukaryotic cells. While the N-terminal 
methionine on most proteins also is efficiently removed in most prokaryotcs, for some 
proteins, this prokaryotic removal process is inefficient, depending on the nature of 
the amino acid to which the N-terminal methionine is covalently linked. 
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in addition 10 encompassing host cells containing the vector constructs 
discussed herein, the invention also encompasses primary, secondary, and 
immortalized host cells of vertebrate origin, particularly mammalian origin, that have 
been engineered to delete or replace endogenous genetic material (e.g.. coding 
sequence), and/or to include genetic material (e.g., heterologous polynucleotide 
sequences) that is operably associated with the polynucleotides of the invention, and 
which activates, alters, and/or amplifies endogenous polynucleotides. For example, 
techniques known in the art may be used to operably associate heterologous control 
regions (e.g., promoter and/or enhancer) and endogenous polynucleotide sequences 
via homologous recombination (see, e.g., U.S. Patent No. 5,641,670, issued June 24, 
1997, International Publication No. WO 96/29411, published September 26, 1996; 
International Publication No. WO 94/12650, published August 4, 1994; Roller et ah, 
Proc. Natl. Acad. Sci. USA 86:8932-8935 (1989); and Zijlstra et ah. Nature 342:435- 
438 (1989), the disclosures of each of which are incorporated by reference in their 
entireties). 

Uses of the PolinnirWiAH, 

Each of the polynucleotides identified herein can be used in numerous ways as 
reagents. The following description should be considered exemplary and utilizes 
known techniques. 

The polynucleotides of the present invention are useful for chromosome 
identification. There exists an ongoing need to identify new chromosome markers, 
since few chromosome marking reagents, based on actual sequence data (repeat 
polymorphisms), are presently available. Each polynucleotide of the present 
invention can be used as a chromosome marker. 

Briefly, sequences can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp) from the sequences shown in SEQ ID NO:X. Primers can be 
selected using computer analysis so that primers do not span more dian one predicted 
exon in the genomic DNA. These primers are then used for PCR screening of 
somatic cell hybrids containing individual human chromosomes. Only those hybrids 
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containing the human gene corresponding to the SEQ ID NO:X will yield an 
amplified fragment 

Similarly, somatic hybrids provide a rapid method of PCR mapping the 
polynucleotides to particular chromosomes. Three or more clones can be assigned per 
day using a single thermal cycler. Moreover, sublocalizalion of the polynucleotides 
can be achieved with panels of specific chromosome fragments. Other gene mapping 
strategies that can be used include in situ hybridization, prescreening with labeled 
flow-sorted chromosomes, and preselection by hybridization to construct 
chromosome specific-cDN^ libraries. 

Precise chromosomal location of the polynucleotides can also be achieved - 
using fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. 
This technique uses polynucleotides as short as 500 or 600 bases; however, 
polynucleotides 2,000-4,000 bp are preferred. For a review of this technique, see 
Verma et al., "Human Chromosomes: a Manual of Basic Techniques " Pergamon 
Press, New York (1988). 

For chromosome mapping, the polynucleotides can be used individually (to 
mark a single chromosome or a single site on that chromosome) or in panels (for 
marking multiple sites and/or multiple chromosomes). Preferred polynucleotides 
correspond to the noncoding regions of the cDNAs because the coding sequences are 
more likely conserved within gene families, thus increasing the chance of cross 
hybridization during chromosomal mapping. 

Once a polynucleotide has been mapped to a precise chromosomal location, 
the physical position of the polynucleotide can be used in linkage analysis. Linkage 
analysis establishes coinheritance between a chromosomal location and presentation 
of a particular disease. (Disease mapping data are found, for example, in V. 
McKusick, Mendelian Inheritance in Man (available on line through Johns Hopkins 
University Welch Medical Library) .) Assuming I megabase mapping resolution and 
one gene per 20 kb, a cDNA precisely localized to a chromosomal region associated 
with the disease could be one of 50-500 potential causative genes. 

Thus, once coinheritance is established, differences in the polynucleotide and 
the corresponding gene between affected and unaffected individuals can be examined. 
First, visible structural alterations in the chromosomes, such as deletions or 
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translocations, are examined in chromosome spreads or by PGR. If no structural 
alterations exist, the presence of point mutations are ascertained. Mutations observed 
in some or all affected individuals, but not in normal individuals, indicates that the 
mutation may cause the disease. However, complete sequencing of the polypeptide 
and the corresponding gene from several normal individuals is required to distinguish 
the mutation from a polymorphism. If a new polymorphism is identified, this 
polymorphic polypeptide can be used for further linkage analysis. 

Furthermore, increased or decreased expression of the gene in affected 
individuals as compared to?unaffected individuals can be assessed using 
polynucleotides of the presenr invention. Any of these alterations (altered expression; 
chromosomal rearrangement, or mutation) can be used as a diagnostic or prognostic 
marker. 

In addition to the foregoing, a polynucleotide can he used to control gene 
expression through triple helix formation or antisense DNA or RNA. Both methods 
rely on binding of the polynucleotide to DNA or RNA. For these techniques, 
preferred polynucleotides are usually 20 to 40 bases in length and complementary to 
either the region of the gene involved in transcription (triple helix - see Lee et al., 
Nad. Acids Res. 6:3073 (1979); Cooney et al.. Science 241:456 (1988); and Dervan 
et ah, Science 25 1 : 1 360 ( 1 99 1 )) or to the mRNA itself (antisense - Okano, J, 
Ncurochem. 56:560 (1991); Oligodeoxy-nucleotides as Antisense Inhibitors of Gene 
Expression, CRC Press, Boca Raton. FL (1988).) Triple helix formation optimally 
results in a shut-off of RNA transcription from DNA, while antisense RNA 
hybridization blocks translation of an mRNA molecule into polypeptide. Both 
techniques are effective in model systems, and the information disclosed herein can 
he used to design antisense or triple helix polynucleotides in an effort to treat disease. 

Polynucleotides of the present invention are also useful in gene therapy. One 
goal of gene therapy is to insert a normal gene into an organism having a defective 
geue, in an effort to correct the genetic defect. The polynucleotides disclosed in the 
present invention offer a means of targeting such genetic defects in a highly accurate 
manner. Another goal is to insert a new gene that, was not present in the host genome, 
thereby producing a new trait in the host cell. 
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The polynucleotides are also useful for identifying individuals from minute 
biological samples. The United States military, for example, is considering the use of 
restriction fragment length polymorphism (RFLP) for identification of its personnel. 
In this technique, an individual s genomic DN A is digested with one or more 
restriction enzymes, and probed on a Southern blot to yield unique bands for 
identifying personnel. This method does not suffer from the current limitations of 
"Dog Tags" which can be lost, switched, or stolen, making positive identification 
difficult. The polynucleotides of the present invention can be used as additional DNA 
markers for RFLP. # 

The polynucleotides of the present invention can also be used as an alternative 
to RFLP, by detennining the actual base-by-base DNA sequence of selected portions 
of an individual's genome. These sequences can be used to prepare PCR primers for 
amplifying and isolating such selected DNA, which can then be sequenced. Using 
this technique, individuals can be identified because each individual will have a 
unique set of DNA sequences. Once an unique ID database is established for an 
individual, positive identification of that individual, living or dead, can be made from 
extremely small tissue samples. 

Forensic biology also benefits from using DNA-based identification 
techniques as disclosed herein. DNA sequences taken from very small biological 
samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, 
etc., can be amplified using PCR. In one prior art technique, gene sequences 
amplified from polymorphic loci, such as DQa class II HLA gene, are used in forensic 
biology to identify individuals. (Erlich, R, PCR Technology, Freeman and Co. 
(1992).) Once these specific polymorphic loci arc amplified, they are digested with 
one or more restriction enzymes, yielding an identifying set of bands on a Southern 
blot probed with DNA corresponding to the DQa class II HLA gene. Similarly, 
polynucleotides of the present invention can be used as polymorphic markers for 
forensic purposes. 

There is also a need for reagents capable of identifying the source of a 
particular tissue. Such need arises, for example, in forensics when presented with 
tissue of unknown origin. Appropriate reagents can comprise, for example, DNA 
probes or primers specific to particular tissue prepared from the sequences of the 



WO 00/06698 PCT/US99/17130 



present invention. Panels of such reagents can identify tissue by species and/or by 
organ type. In a similar fashion, these reagents can be used to screen tissue cultures 
10 for contamination. 

In the very least, the polynucleotides of the present invention can be used as 
5 molecular weight markers on Southern gels, as diagnostic probes for the presence of a 

specific mRNA in a particular cell type, as a probe to "subtract-out" known sequences 
15 . 

in tne process of discovering novel polynucleotides, for selecting and making 

oligomers for attachment to a "gene chip" or other support, to raise anti-DN A 

antibodies using DNA immunization techniques, and as an antigen to elicit an 

2Q 10 immune response. 

Uses of the Polype ptides 

Each of the polypeptides identified herein can be used in numerous ways. The 
following description should be considered exemplary and utilizes known techniques. 
15 A polypeptide of the present invention can be used to assay protein levels in a 

biological sample using antibody-based techniques. For example, protein expression 
in tissues can be studied with classical immunohistological methods. (Jalkanen, M., 
et ah, J. Cell. Biol. 101 :976-985 (1985); Jalkanen, M., et al, J. Cell . Biol. 105:3087- 
3096 (1987).) Other antibody-based methods useful for detecting protein gene 
expression include immunoassays, such as the enzyme linked immunosorbent assay 
35 (ELISA) and the radioimmunoassay (RIA). Suitable antibody assay labels are known 

in the art and include enzyme labels, such as, glucose oxidase, and radioisotopes, such 
as iodine (1251, 1211), carbon (14C), sulfur (35S), tritium (3H), indium (1 12In), and 
technetium (99mTc), and fluorescent labels, such as fluorescein and rhodamine, and 
40 25 biotin. 

In addition to assaying secreted protein levels in a biological sample, proteins 
can also be detected in vivo by imaging. Antibody labels or markers for in vivo 
45 imaging of protein include those detectable by X-radiography. NMR or ESR. For X- 

radiography, suitable labels include radioisotopes such as barium or cesium, which 
30 emit delectable radiation but are not overtly harmful to the subject. Suitable markers 
for NMR and ESR include those with a detectable characteristic spin, such as 
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deuterium, which may be incorporated into the antibody by labeling of nutrients for 
the relevant hybridoma. 

A protein-specific antibody or antibody fragment which has been labeled with 
an appropriate detectable imaging moiety, such as a radioisotope (for example, 1311, 
1 12In, 99mTc), a radio-opaque substance, or a material detectable by nuclear 
magnetic resonance, is introduced (for example, parenterally, subcutaneously, or 
intraperitoneally) into the mammal. It will be understood in the an that the size of the 
subject and the imaging system used will determine the quantity of imaging moiety 
needed to produce diagnostic images. In the case of a radioisotope moiety, for a 
human subject, the quantity of radioactivity injected will normally range from about 5 ' 
to 20 millicurics of 99mTc. The labeled antibody or antibody fragment will then 
preferentially accumulate at the location of cells which contain the specific protein. 
In vivo tumor imaging is described in S.W. Burchiel et al., "Immunopharrnacokinetics 
of Radiolabeled Antibodies and Their Fragments." (Chapter 13 in Tumor Imaging: 
The Radiochemical Detection of Cancer, S.W. Burchiel and B . A. Rhodes, eds., 
Masson Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which 
involves (a) assaying the expression of a polypeptide of the present invention in cells 
or body fluid of an individual; (b) comparing the level of gene expression with a 
standard gene expression level, whereby an increase or decrease in the assayed 
polypeptide gene expression level compared to the standard expression level is 
indicative of a disorder. 

Moreover, polypeptides of the present invention can be used to treat disease. 
For example, patients can be administered a polypeptide of the present invention in an 
effort to replace absent or decreased levels of the polypeptide (e.g., insulin), to 
supplement absent or decreased levels of a different polypeptide (e.g., hemoglobin S 
for hemoglobin B), to inhibit the activity of a polypeptide (e.g., an oncogene), to 
activate the activity of a polypeptide (e.g., by binding to a receptor), to reduce the 
activity of a membrane bound receptor by competing with it for free Iigand (e.g., 
soluble TNF receptors used in reducing inflammation), or to bring about a desired 
response (e.g., blood vessel growth). 
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Similarly, antibodies directed to a polypeptide of the present invention can 
also be used to treat disease. For example, administration of an antibody directed to a 
polypeptide of the present invention can bind and reduce overproduction of the 
polypeptide. Similarly, administration of an antibody can activate the polypeptide, 
such as by binding to a polypeptide bound to a membrane (receptor). 

At the very least, the polypeptides of the present invention can be used as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 
columns using methods well known to those of skill in the art Polypeptides can also 
be used to raise antibodies/which in turn are used to measure protein expression from 
a recombinant cell, as a way of assessing transformation of the host cell. Moreover, * 
the polypeptides of the present invention can be used to test the following biological 
activities. 



Biologica l Activities 

The polynucleotides and polypeptides of the present invention can be used in 
assays to test for one or more biological activities. If these polynucleotides and 
polypeptides do exhibit activity in a particular assay, it is likely that these molecules 
may be involved in the diseases associated with the biological activity. Thus, the 
polynucleotides and polypeptides could be used to treat the associated disease. 

Immune Vivify 

A polypeptide or polynucleotide of the present invention may be useful in 
treating deficiencies or disorders of the immune system, by activating or inhibiting the 
proliferation, differentiation, or mobilization (chemotaxis) of immune cells. Immune 
cells develop through a process called hematopoiesis, producing myeloid (platelets, 
red blood cells, neutrophils, and macrophages) and lymphoid (B and T lymphocytes) 
cells from pluripoient stem cells. The etiology of these immune deficiencies or 
disorders may be genetic, somatic, such as cancer or some autoimmune disorders, 
acquired (e.g., by chemotherapy or toxins), or infectious. Moreover, a polynucleotide 
or polypeptide of the present invention can be used as a marker or detector of a 
particular immune system disease or disorder. 
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A polynucleotide or polypeptide of the present invention may be useful in 
treating or detecting deficiencies or disorders of hematopoietic cells. A 
polypeptide or polynucleotide of the present invention could be used to increase 
differentiation and proliferation of hematopoietic cells, including the piuripotent stem 
cells, in an effort to treat those disorders associated with a decrease in certain (or 
many) types hematopoietic cells. Examples of immunologic deficiency syndromes 
include, but are not limited to: blood protein disorders (e.g. agammaglobulinemia, 
dysganunaglobulinernia), ataxia telangiectasia, common variable immunodeficiency, 
Digeorge Syndrome, HIV#infection, HTLV-BLV infection, leukocyte adhesion 
deficiency syndrome, lymphopenia, phagocyte bactericidal dysfunction, severe 
combined immunodeficiency (SCIDs), Wiskott-Aldrich Disorder, anemia, 
thrombocytopenia, or hemoglobinuria. 

Moreover, a polypeptide or polynucleotide of the present invention could also 
be used to modulate hemostatic (the stopping of bleeding) or thrombolytic activity 
(clot formation). For example, by increasing hemostatic or thrombolytic activity, a 
polynucleotide or polypeptide of the present invention could be used to treat blood 
coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 
disorders (e.g. thrombocytopenia), or wounds resulting from trauma, surgery, or other 
causes. Alternatively, a polynucleotide or polypeptide of the present invention that 
can decrease hemostatic or thrombolytic activity could be used to inhibit or dissolve 
clotting. These molecules could be important in the treatment of heart attacks 
(infarction), strokes, or scarring. 

A polynucleotide or polypeptide of the present invention may also be useful in 
treating or detecting autoimmune disorders. Many autoimmune disorders result from 
inappropriate recognition of self as foreign material by immune cells. This 
inappropriate recognition results in an immune response leading to the destruction of 
the host tissue. Therefore, the administration of a polypeptide or polynucleotide of the 
present invention that inhibits an immune response, particularly the proliferation, 
differentiation, or chemotaxis of T-cells, may be an effective therapy in preventing 
autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by the 
present invention include-; but are not limited to: Addison's Disease, hemolytic 
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anemia, antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic 
encephalomyelitis, glomerulonephritis, Goodpasture's Syndrome, Graves' Disease, 
'0 Multiple Sclerosis, Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, 

Pemphigus, Poiyendocrinopathies, Purpura, Reiter's Disease, Stiff-Man Syndrome, 
5 Autoimmune Thyroiditis, Systemic Lupus Erythematosus, Autoimmune Pulmonary 
Inflammation, Guillain-Barre Syndrome, insulin dependent diabetes mellitis, and 
autoimmune inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly 
allergic asthma) or other respiratory problems, may also be treated by a polypeptide 
20 1 0 or polynucleotide of the present invention. Moreover, these molecules can be used to 

treat anaphylaxis, hypersensitivity to an antigenic molecule, or blood group 
incompatibility. 

A polynucleotide or polypeptide of the present invention may also be used to 
treat and/or prevent organ rejection or graft-versus-host disease (GVHD). Organ 
rejection occurs by host immune cell destruction of the transplanted tissue through an 
immune response. Similarly, an immune response is also involved in GVHD. but, in 
this case, the foreign transplanted immune cells destroy the host tissues. The 
administration of a polypeptide or polynucleotide of the present invention that inhibits 
an immune response, particularly the proliferation, differentiation, or chemotaxis of 
20 T-cells, may be an effective therapy in preventing organ rejection or GVHD. 
35 Similarly, a polypeptide or polynucleotide of the present invention may also 

be used to modulate inflammation. For example, the polypeptide or polynucleotide 
may inhibit the proliferation and differentiation of cells involved in an inflammatory 
response. These molecules can be used to treat inflammatory conditions, both chronic 
and acute conditions, including inflammation associated with infection (e.g., septic 
shock, sepsis, or systemic inflammatory response syndrome (SIRS)), ischemia- 
reperfusion injury, endotoxin lethality, arthritis, complement-mediated hyperacute 
45 rejection, nephritis, cytokine or chemokine induced lung injury, inflammatory bowel 

disease, Crohn's disease, or resulting from over production of cytokines (e.g., TNF or 
30 IL-1.) 

50 Hvpernrn|ifP fative T)isorri m 
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A polypeptide or polynucleotide can be used to treat or detect - 
hyperoroliferative disorders, including neoplasms. A polypeptide or polynucleotide 
10 of ™e present invention may inhibit the proliferation of the disorder through direct or 

indirect interactions. Alternatively, a polypeptide or polynucleotide of the present 
5 invention may proliferate other cells which can inhibit the hyperproliferative disorder. 
For example, by increasing an immune response, particularly increasing 
antigenic qualities of the hyperproliferative disorder or by proliferating, 
differentiating, or mobilizing T-cells, hypeiproliferative disorders can be treated. 
This immune response ma^ be increased by either enhancing an existing immune 
20 10 response, or by initiating a new immune response. Alternatively, decreasing an 

immune response may also be a method of treating hyperproliferative disorders, such 
as a chemotherapeutic agent. 

Examples of hyperproliferative disorders that can be treated or detected by a 
polynucleotide or polypeptide of the present invention include, but are not limited to 
neoplasms located in the: abdomen, bone, breast, digestive system, liver, pancreas, 
peritoneum, endocrine glands (adrenal, parathyroid, pituitary, testicles, ovary, thymus, 
thyroid), eye, head and neck, nervous (central and peripheral), lymphatic system, 
pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated or detected by 
a polynucleotide or polypeptide of the present invention. Examples of such 
35 hyperproliferative disorders include, but are not limited to: 

hypergammaglobulinemia, lymphoproliferative disorders, paraproteinemias, purpura, 
- sarcoidosis, Sezary Syndrome, Waldenstrom s Macroglobulinemia, Gaucher's 

Disease, histiocytosis, and any other hyperproliferative disease, besides neoplasia 
40 ' 
25 located in an organ system listed above. 
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Infectious Oigfrflff? 

45 A Polypeptide or polynucleotide of the present invention can be used to treat 

or detect infectious agents. For example, by increasing the immune response, 
30 particularly increasing the proliferation and differentiation of B and/ur T cells, 

infectious diseases may be treated. The immune response may be increased by either 

50 * enhancing an existing immune response, or by initiating a new immune response. 
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Alternatively, the polypeptide or polynucleotide of the present invention may also 
directly inhibit the infectious agent, without necessarily eliciting an immune response. 

Viruses are one example of an infectious agent that can cause disease or 
symptoms that can be treated or detected by a polynucleotide or polypeptide of the 
present invention. Examples of viruses, include, but are not limited to the following 
DNA and RNA viral families: Arbovirus, Adenoviridae, Arenaviridae, Aiterivirus, 
Birnaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Haviviridae, 
Hepadnaviridae (Hepatitis), Herpesviridae (such as, Cytomegalovirus, Herpes 
Simplex, Herpes Zoster), |£ononegavinis (e.g., Paramyxoviridae. Morbilli virus, 
Rhabdoviridae), Orthomyxoviridae (e.g., Influenza), Papovaviridae, Parvoviridae, . 
Picomaviridae, Poxviridae (such as Smallpox or Vaccinia), Reoviridae (e.g., 
Rotavirus), Rctroviridae (HTLV-I, HTLV-II, Lentivirus), and Togaviridae (e.g., 
Rubivirus). Viruses falling within these families can cause a variety of diseases or 
symptoms, including, but not limited to: arthritis, bronchiollitis, encephalitis, eye 
infections (e.g., conjunctivitis, keratitis), chronic fatigue syndrome, hepatitis (A, B, C, 
E, Chronic Active, Delta), meningitis, opportunistic infections (e.g., AIDS), 
pneumonia, Burkitt's Lymphoma, chickenppx , hemorrhagic fever, Measles, Mumps, 
Parainfluenza, Rabies, the common cold. Polio, leukemia, Rubella, sexually 
transmitted diseases, skin diseases (e.g., Kaposi's, warts), and viremia. A polypeptide 
or polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

Similarly, bacterial or fungal agents that can cause disease or symptoms and 
that can be treated or detected by a polynucleotide or polypeptide of the present 
invention include, but not limited to, the following Gram-Negative and Gram-positive 
bacterial families and fungi: Actinomycetales (e.g., Corynebacterium, 
Mycobacterium, Norcardia), Aspergillosis, Bacillaceac (e.g., Anthrax, Clostridium), 
Bacteroidaceae, Blastomycosis, Bordetella, Borrelia, Brucellosis, Candidiasis, 
Campylobacter, Coccidioidomycosis, Cryptococcosis, Dermatocycoses, 
Enterobacteriaceae (Klebsiella, Salmonella, Serratia, Yersinia), Erysipelothrix, 
Helicobacter, Legionellosis, Leptospirosis, Listeria, Mycoplasmaiales, Neisseriaccae 
(e.g., Acinetobacter, Gonorrhea, MenigococcaJ), Pasteureliacea Infections (e.g., 
AcUnobacillus. Heamophilus, Pasteurella), Pseudomonas, Rickettsiaceae, 
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Chlamydiaceae, Syphilis, and Staphylococcal. These bacterial or fungal families can 
cause the following diseases or symptoms, including, but not limited to: bacteremia, 
endocarditis, eye infections (conjunctivitis, tuberculosis, uveitis), gingivitis, 
opportunistic infections (e.g., AIDS related infections), paronychia, prosthesis-related 
infections, Reiter's Disease, respiratory tract infections, such as Whooping Cough or 
Empyema, sepsis, Lyme Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, 
food poisoning, Typhoid, pneumonia, Gonorrhea, meningitis, Chlamydia, Syphilis, 
Diphtheria, Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, 
tetanus, impetigo, Rheumatic Fever, Scarlet Fever, sexually transmitted diseases, skin 
diseases (e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, wound * 
infections. A polypeptide or polynucleotide of the present invention can be used to 
treat or detect any of these symptoms or diseases. 

Moreover, parasitic agents causing disease or symptoms that can be treated or 
detected by a polynucleotide or polypeptide of the present invention include, but not 
limited to, the following families: Amebiasis, Babesiosis, Coccidiosis, 
Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, Giardiasis, 
Helminthiasis, Leishmaniasis, Theileriasis, Toxoplasmosis, Trypanosomiasis, and 
Tr.choinonas. These parasites can cause a variety of diseases or symptoms, including, 
but not limited to: Scabies, Trombiculiasis, eye infections, intestinal disease (e.g., 
dysentery, giardiasis), liver disease, lung disease, opportunistic infections (e.g., AIDS 
related), Malaria, pregnancy complications, and toxoplasmosis. A polypeptide or 
polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

Preferably, treatment using a polypeptide or polynucleotide of the present 
invention could either be by administering an effective amount of a polypeptide to the 
patient, or by removing cells from the patient, supplying the cells with a 
polynucleotide of the present invention, and returning the engineered cells to the 
patient (ex vivo therapy). Moreover, the polypeptide or polynucleotide of the present 
invention can be used as an antigen in a vaccine to raise an immune response against 
infectious disease. 

Regeneration 
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A polynucleotide or polypeptide of the present invention can be used to 
differentiate, proliferate, and attract cells, leading to the regeneration of tissues. (See, 
Science 276:59-87 ( 1997).) The regeneration of tissues could be used to repair, 
replace, or protect tissue damaged by congenital defects, trauma (wounds, bums, 
incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, periodontal 
disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, 
reperfusion injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs 
(e.g., pancreas, liver, intestgae, kidney, skin, endothelium), muscle (smooth, skeletal 
or cardiac), vasculature (including vascular and lymphatics), nervous, hematopoietic, - 
and skeletal (bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration 
occurs without or decreased scarring. Regeneration also may include angiogenesis. 

Moreover, a polynucleotide or polypeptide of the present invention may 
increase regeneration of tissues difficult to heal. For example, increased 
tendon/ligament regeneration would quicken recovery time after damage. A 
polynucleotide or polypeptide of the present invention could also be used 
prophylactically in an effort to avoid damage. Specific diseases that could be treated 
include of tendinitis, carpal tunnel syndrome, and other tendon or ligament defects. A 
further example of tissue regeneration of non-healing wounds includes pressure 
ulcers, ulcers associated with vascular insufficiency, surgical, and traumatic wounds. 

Similarly, nerve and brain tissue could also be regenerated by using a 
polynucleotide or polypeptide of the present invention to proliferate and differentiate 
nerve cells. Diseases that could be treated using this method include central and 
peripheral nervous system diseases, neuropathies, or mechanical and traumatic 
disorders (e.g., spinal cord disorders, head trauma, cerebrovascular disease, and 
stoke). Specifically, diseases associated with peripheral nerve injuries, peripheral 
neuropathy (e.g., resulting from chemotherapy or other medical therapies), localized 
neuropathies, and central nervous system diseases (e.g., Alzheimer's disease, 
Parkinsons disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy- 
Drager syndrome), could all be treated using the polynucleotide or polypeptide of the 
present invention. 
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Chemotavk 

A polynucleotide or polypeptide of the present invention may have 
chemotaxis activity. A chemotaxic molecule attracts or mobilizes cells (e.g., 
monocytes, fibroblasts, neutrophils, T-cells. mast cells, eosinophils, epithelial and/or 
endothelial cells) to a particular site in the body, such as inflammation, infection, or 
site of hypcrproliferation. The mobilized cells can then fight off and/or heal the 
particular trauma or abnormality. 

A polynucleotide or polypeptide of the present invention may increase 
chemotaxic activity of particular cells. These chemotactic molecules can then be used 
to treat inflammation, infection, hyperproliferativc disorders, or any immune system 
disorder -by increasing the number of cells targeted to a particular location in the body. 
For example, chemotaxic molecules can be used to treat wounds and other trauma to 
tissues by attracting immune cells to the injured location. Chemotactic molecules of 
the present invention can also atLract fibroblasts, which can be used to treat wounds. 

It is also contemplated that a polynucleotide or polypeptide of the present 
invention may inhibit chemotactic activity. These molecules could also be used to 
treat disorders. Thus, a polynucleotide or polypeptide of the present invention could 
be used as an inhibitor of chemotaxis. 

Binding Ar«yjjj Y 

A polypcpude of the present invention may be used to screen for molecules 
that bind to the polypeptide or for molecules to which the polypeptide binds. The 
binding of the polypeptide and the molecule may activate (agonist), increase, inhibit 
(antagonist), or decrease activity of the polypeptide or the molecule bound. Examples 
of such molecules include antibodies, oligonucleotides, proteins (e.g.. receptors),or 
small molecules. 

Preferably, the molecule is closely related to the natural ligand of the 
polypeptide, e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural 
or functional mimetic. (See, Coligan et ah. Current Protocols in Immunology 
l(2):Chapter 5 (1991).) Similarly, the molecule can be closely related to the natural 
receptor to which the polypeptide binds, or at least, a fragment of the receptor capable 



WO 00/06698 



PCT/US99/I7130 



15 



10 

20 



of being bound by the polypeptide (e.g., active site). In either case, the molecule can 
be rationally designed using known techniques. 
10 Preferably, the screening for these molecules involves producing appropriate 

cells which express the polypeptide, either as a secreted protein or on the cell 
5 membrane. ?referred cells include cells from mammals, yeast, Drosophila, or E. colL 
Cells expressing the polypeptide (or cell membrane containing the expressed 
polypeptide) are then preferably contacted with a test compound potentially 
containing the molecule to observe binding, stimulation, or inhibition of activity of 
either the polypeptide or the molecule. 

The assay may simply test binding of a candidate compound to the 
polypeptide, wherein binding is detected by a label, or in an assay involving 
competition with a labeled competitor. Further, the assay may test whether the 
candidate compound results in a signal generated by binding to the polypeptide. 
25 Alternatively, the assay can be carried out using cell-free preparations, 

15 pul ypeptide/molecule affixed to a solid support, chemical libraries, or natural product 
mixtures. The assay may also simply comprise the steps of mixing a candidate 
compound with a solution containing a polypeptide, measuring polypeptide/molecule 
activity or binding, and comparing the poiypeptide/molecuie activity or binding to a 
standard. 

20 Preferably, an ELISA assay can measure polypeptide level or activity in a 

35 sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 

antibody can measure polypeptide level or activity by either binding, direcdy or 
indirectly, to the polypeptide or by competing with the polypeptide for a substrate. 

All of these above assays can be used as diagnostic or prognostic markers. 
The molecules discovered using these assays can be used to treat disease or to bring 
about a particular result in a patient (e.g., blood vessel growth) by activating or 
inhibiting the polypeptide/molecule. Moreover, the assays can discover agents which 
may inhibit or enhance the production of the polypeptide from suitably manipulated 
cells or tissues. 

Therefore, the invention includes a method of identifying compounds which 
bind to a polypeptide of the invention comprising the steps of: (a) incubating a 
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candidate binding compound with a polypeptide of the invention; and (b) determining 
if binding has occurred. Moreover, the invention includes a method of identifying 
10 agonists/antagonists comprising the steps of: (a) incubating a candidate compound 

with a polypeptide of the invention, (b) assaying a biological activity , and (b) 
5 determining if a biological activity of the polypeptide has been altered. 

15 .Other Activities 

A polypeptide or polynucleotide of the present invention may also increase or 
decrease the differentiation or proliferation of embryonic stem cells, besides, as 
10 discussed above, hematopoietic lineage. 

A polypeptide or polynucleotide of the present invention may also be used to 
modulate mammalian characteristics, such as body height, weight, hair color, eye 
color, skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., cosmetic 
25 surgery). Similarly, a polypeptide or polynucleotide of the present invention may be 

15 used to modulate mammalian metabolism affecting catabolism, anabolism, 
processing, utilization, and storage of energy. 

A polypeptide or polynucleotide of the present invention may be used to 
change a mammal's mental state or physical state by influencing biorhythms, 
caricadic rhythms, depression (including depressive disorders), tendency for violence, 
tolerance for pain, reproductive capabilities (preferably by Activin or Inhibin-Iike 
activity), hormonal or endocrine levels, appetite, libido, memory, stress, or other 
cognitive qualities. 

A polypeptide or polynucleotide of the present invention may also be used as a 
food additive or preservative, such as to increase or decrease storage capabilities, fat 
content. lipid, protein, carbohydrate, vitamins, minerals, cofactors or other nutritional 
components. 
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Other PrefwrJ Emhodimon^ 

Other preferred embodiments of the claimed invention include an isolated 
30 nucleic acid molecule comprising a nucleotide sequence which is at least 95% 

identical to a sequence of at least about 50 contiguous nucleotides in the nucleotide 
50 sequence of SEQ ID NO:X wherein X is any integer as defined in Table 1. 
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Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5* Nucleotide of 
the Clone Sequence and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table i. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of 
the Start Codon and ending ;with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO;X in Table 1 . 

Similarly preferred is a nucleic acid molecule wherein said sequence of 
contiguous nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the 
range of positions beginning with the nucleotide at about the position of the 5* 
Nucleotide of the First Amino Acid of the Signal Peptide and ending with the 
nucleotide at about the position of the 3' Nucleotide of the Clone Sequence as defined 
for SEQ ID NO:X in Table 1. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 150 
contiguous nucleotides in the nucleotide sequence of SEQ ID NO:X. 

Further preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 500 
contiguous nucleotides in the nucleotide sequence of SEQ ID NO:X. 

A further preferred embodiment is a nucleic acid molecule comprising a 
nucleotide sequence which is at least 95% identical to the nucleotide sequence of SEQ 
ID NO:X beginning with the nucleotide at about the position of the 5' Nucleotide of 
the First Amino Acid of the Signal Peptide and ending with the nucleotide at about 
the position of the 3* Nucleotide of the Clone Sequence as defined for SEQ ID NO:X 
in Table 1. 

A further preferred embodiment is an isolated nucleic acid molecule 
comprising a nucleotide sequence which is at least 95% identical to the complete 
nucleotide sequence of SEQ ID NO:X. 
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Also preferred is an isolated nucleic acid molecule which hybridizes under 
stringent hybridization conditions to a nucleic acid molecule, wherein said nucleic 
acid molecule which hybridizes does not hybridize under stringent hybridization 
conditions to a nucleic acid molecule having a nucleotide sequence consisting of only 
A residues or of only T residues. 

Also preferred is a composition of matter comprising a DNA molecule which 
comprises a human cDNA clone identified by a cDNA Clone Identifier in Table 1, 
which DNA molecule is contained in the material deposited with the American Type 
Culture Collection and given the ATCC Deposit Number shown in Table 1 for said 
cDNA Clone Identifier. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least 50 contiguous 
nucleotides in the nucleotide sequence of a human cDNA clone identified by a cDNA 
Clone Identifier in Table 1, which DNA molecule is contained in the deposit given the 
ATCC Deposit Number shown in Table 1. 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of 
at least 50 contiguous nucleotides is included in the nucleotide sequence of the 
complete o^er. reading frame sequence encoded by said human cDNA clone. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to sequence of at least 150 contiguous 
nucleotides in the nucleotide sequence encoded by said human cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule 
comprising a nucleotide sequence which is at least 95% identical to sequence of at 
least 500 contiguous nucleotides in the nucleotide sequence encoded by said human 
cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule 
comprising a nucleotide sequence which is at least 95% identical to the complete 
nucleotide sequence encoded by said human cDN A clone. 

A further preferred embodiment is a method for delecting in a biological 
sample a nucleic acid molecule comprising a nucleotide sequence which is at least 
95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X 
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wherein X is any integer as defined in Table 1; and a nucleotide sequence encoded by 
a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1 ; which method comprises a step of comparing a nucleotide sequence of at least one 
nucleic acid molecule in said sample with a sequence selected from said group and 
determining whether the sequence of said nucleic acid molecule in said sample is at 
least 95% identical to said selected sequence. 

Also preferred is the above method wherein said step of comparing sequences 
comprises determining the extent of nucleic acid hybridization between nucleic acid 
molecules in said sample and a nucleic acid molecule comprising said sequence 
selected from said group. Similarly, also preferred is the above method wherein said 
step of comparing sequences is performed by comparing the nucleotide sequence 
determined from a nucleic acid molecule in said sample with said sequence selected 
from said group. The nucleic acid molecules can comprise DNA molecules or RNA 
molecules. 

A further preferred embodiment is a method for identifying the species, tissue 
or cell type of a biological sample which method comprises a step of detecting nucleic 
acid molecules in said sample, if any, comprising a nucleotide sequence that is at least 
95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X 
wherein X is any integer as defined in Table 1; and a nucleotide sequence encoded by 
a human cDN A clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1. 

The method for identifying the species, tissue or cell type of a biological 
sample can comprise a step of detecting nucleic acid molecules comprising a 
nucleotide sequence in a panel of at least two nucleotide sequences, wherein at least 
one sequence in said panel is at least 95% identical to a sequence of at least 50 
contiguous nucleotides in a sequence selected from said group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene encoding a secreted 
protein identified in Tabic 1, which method comprises a step of detecting in a 
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biological sample obtained from said subject nucleic acid molecules, if any, 
comprising a nucleotide sequence that is at least 95% identical to a sequence of at 
least 50 contiguous nucleotides in a sequence selected from the group consisting of: a 
nucleotide sequence of SEQ ID NO:X wherein X is any integer as defined in Table 1 ; 
and a nucleotide sequence encoded by a human cDN A clone identified by a cDNA 
Clone Identifier in Table 1 and contained in the deposit with the ATCC Deposit 
Number shown for said cDNA clone in Table 1. 

The method for diagnosing a pathological condition can comprise a step of 
detecting nucleic acid molecules comprising a nucleotide sequence in a panel of at 
least two nucleotide sequences, wherein at least one sequence in said panel is at least 
95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from said group. 

Also preferred is a composition of matter comprising isolated nucleic acid 
molecules wherein the nucleotide sequences of said nucleic acid molecules comprise 
a panel of at least two nucleotide sequences, wherein at least one sequence in said 
panel is at least 95% identical to a sequence of at least 50 contiguous nucleotides in a 
sequence selected from the group consisting of: a nucleotide sequence of SEQ ID 
NO:X wherein X is any integer as defined in Table 1; and a nucleotide sequence 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Tabic 1 
and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table t . The nucleic acid molecules can comprise DNA molecules or RNA 
molecules. 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table 1 . 

Also preferred is a polypeptide, wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of SEQ ID NO:Y in the range of 
positions beginning with the residue at about the position of the First Amino Acid of 
the Secreted Portion and ending with the residue at about the Last Amino Acid of the 
Open Reading Frame as set forth for SEQ ID NO:Y in Table 1. 
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Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of ai least about 30 contiguous amino acids in the 
amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid 
sequence at least 95% identical to a sequence of at least about 100 contiguous amino 
acids in the amino acid sequence of SEQ ID NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid 
sequence at least 95% identical to the complete amino acid sequence of SEQ ID 
NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid 
sequence at least 90% identical to a sequence of at least about 10 contiguous amino 
acids in the complete amino acid sequence of a secreted protein encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is a polypeptide wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of a secreted portion of the secreted 
protein encoded by a human cDNA clone identified by a cDN A Clone identifier in 
Tab!e 1 and contained in the deposit with the ATCC Deposit Number shown for said 
cDNA clone in Table 1 . 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of the secreted portion of the protein encoded by a human cDN A 
clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit 
with the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of at least about 100 contiguous amino acids in 
the amino acid sequence of the secreted portion of the protein encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Tabic 1 . 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to the amino acid sequence of the secreted portion of the protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 
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and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table 1. 

W Further preferred is an isolated antibody which binds specifically to a 

polypeptide comprising an amino acid sequence that is at least 90% identical to a 
5 sequence of at least 1 0 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence ofSEQ ID NO: Y wherein Y is any integer as 
defined in Table 1 ; and a complete amino acid sequence of a protein encoded by a 
human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 

20 * 10 1. 

Further preferred is a method for detecting in a biological sample a 
polypeptide comprising an amino acid sequence which is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
defined in Tabic 1 ; and a complete amino acid sequence of a protein encoded by a 
human cDN A clone identified by a cDN A Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
I ; which method comprises a step of comparing an amino acid sequence of at least 
one polypeptide molecule in said sample with a sequence selected from said group 
and determining whether the sequence of said polypeptide molecule in said sample is 
35 31 Icasl 90% identical to said sequence of at least 10 contiguous amino acids. 

Also preferred is the above method wherein said step of comparing an amino 
acid sequence of at least one polypeptide molecule in said sample with a sequence 
selected from said group comprises determining the extent of specific binding of 
polypeptides in said sample to an antibody which binds specifically to a polypeptide 
comprising an amino acid sequence that is at least 90% identical to a sequence of at 
least 10 contiguous amino acids in a sequence selected from the group consisting of: 
45 an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in 

Table 1 ; and a complete amino acid sequence of a protein encoded by a human cDNA 
clone identified by a cDNA Clone Identifier in Tabic t and contained in the deposit 
with the ATCC Deposit Number shown for said cDNA clone in Table L 



25 

15 

30 

20 



40 

25 



30 



55 



WO 00/06698 



302 



PCT/US99/17130 



Also preferred is the above method wherein said step of comparing sequences 
is performed by comparing the amino acid sequence determined from a polypeptide 
molecule in said sample with said sequence selected from said group. 

Also preferred is a method for identifying the species, tissue or cell type of a 
biological sample which method comprises a step of detecting polypeptide molecules 
in said sample, if any, comprising an amino acid sequence that is at least 90% 
identical to a sequence of at least 10 contiguous amino acids in a sequence selected 
from the group consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is 
any integer as defined in ^able 1; and a complete amino acid sequence of a secreted 
protein encoded by a human cDNA clone identified by a cDNA Clone Identifier in - 
Table 1 and contained in the deposit with the ATCC Deposit Number shown for said 
cDNA clone in Table 1. 

Also preferred is the above method for identifying the species, tissue or cell 
type of a biological sample, which method comprises a step of detecting polypeptide 
molecules comprising an amino acid sequence in a panel of at least two amino acid 
sequences, wherein at least one sequence in said panel is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the above 
group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene encoding a secreted 
protein identified in Table 1, which method comprises a step of detecting in a 
biological sample obtained from said subject polypeptide molecules comprising an 
amino acid sequence in a panel of at least two amino acid sequences, wherein at least 
one sequence in said panel is at least 90% identical to a sequence of at least 10 
contiguous amino acids in a sequence selected from the group consisting of; an amino 
acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1 . 

In any of these methods, the step of detecting said polypeptide molecules 
includes using an antibody. 
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Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a nucleotide sequence encoding a 
polypeptide wherein said polypeptide comprises an amino acid sequence that is at 
least 90% identical to a sequence of at least 10 contiguous amino acids in a sequence 
selected from the group consisting of: an amino acid sequence of SEQ ID NO: Y 
wherein Y is any integer as defined in Table 1 ; and a complete amino acid sequence 
of a secreted protein encoded by a human cDNA clone identified by a cDNA Clone 
Identifier in Table 1 and contained in the deposit with the ATCC Deposit Number 
shown for said cDNA clone in Table 1 . 

Also preferred is an isolated nucleic acid molecule, wherein said nucleotide- 
sequence encoding a polypeptide has been optimized for expression of said 
polypeptide in a prokaryotic host. 

Also preferred is an isolated nucleic acid molecule, wherein said polypeptide 
comprises an amino acid sequence selected from the group consisting of: an amino 
acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table I; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1, 

Further preferred is a method of making a recombinant vector comprising 
inserting any of the above isolated nucleic acid molecule into a vector. Also preferred 
is the recombinant vector produced by this method. Also preferred is a method of 
making a recombinant host cell comprising introducing the vector into a host cell, as 
well as the recombinant host cell produced by this method. 

Also preferred is a method of making an isolated polypeptide comprising 
culturing this recombinant host cell under conditions such that said polypeptide is 
expressed and recovering said polypeptide. Also preferred is this method of making 
an isolated polypeptide, wherein said recombinant host cell is aeukaryotic cell and 
said polypeptide is a secreted portion of a human secreted protein comprising an 
amino acid sequence selected from the group consisting of: an amino acid sequence of 
SEQ ID NO:Y beginning with the residue at the position of the First Amino Acid of 
the Secreted Portion of SEQ ID NO: Y wherein Y is an integer set forth in Table 1 and 
said position of the First Amino Acid of the Secreted Portion of SEQ ID NO: Y is 
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defined in Table 1 ; and an amino acid sequence of a secreted portion of a protein 
encoded by a human cDNA clone identified by a cDN A Clone Identifier in Table 1 
and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table 1. The isolated polypeptide produced by this method is also preferred. 

Also preferred is a method of treatment of an individual in need of an 
increased level of a secreted protein acuvity, which method comprises administering 
to such an individual a pharmaceutical composition comprising an amount of an 
isolated polypeptide, polynucleotide, or antibody of the claimed invention effective to 

increase the level of said protein activity in said individual. 

4 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of ' 
illustration and are not intended as limiting. 



Examples 

Example 1; Isolation of a Selected cDNA C lone From the Denosil«»H S^ ptf 

Each cDNA clone in a cited ATCC deposit is contained in a piasmid vector. 
Table 1 identifies the vectors used to construct the cDNA library from which each 
clone was isolated. In many cases, the vector used to construct the library is a phage 
vector from which a piasmid has been excised. The table immediately below 
correlates the related piasmid for each phage vector used in constructing the cDNA 
library. For example, where a particular clone is identified in Table I as being 
isolated in the vector "Lambda Zap," the corresponding deposited clone is in 
"pBluescript." 

Vector Used to Construct Library Corresponding Deposited 

Piasmid 

Lambda Zap pBluescript (pBS) 

Uni-Zap XR pBluescript (pBS) 

Zap Express pBK 
UfmidBA plafmid BA 

pSportl pSportl 
pCMVSport 2.0 pCMVSport 2.0 
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pCMVSport 3.0 
pCR*2.1 



pCMVSport 3.0 
pCR e 2.I 



Vectors Lambda Zap (U.S. Patent Nos. 5,128256 and 5,286,636). Uni-Zap 

XR (U.S. Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 

5,128,256 and 5,286,636), pBluescript (pBS) (Short, J. M. et al., Nucleic Acids Res. 

16:7583-7600 (1988); Alting-Mecs, M A. and Short, J. M., Nucleic Acids Res. 

17:9494 (1989)) and pBK (Alting-Mees, M. A. et ah, Strategies 5:58-61 (1992)) are 
commercially available from Stratagene Cloning Systems, Inc., 1 1011 N. Torrey 
Pines Road, La Jolla, CA, 92037. pBS contains an ampicillin resistance gene and 
pBK contains a neomycin resistance gene. Both can be transformed into E. coli strain 
XL-1 Blue, also available from Stratagene. pBS comes in 4 forms SK+, SK-, KS-f 
and KS. The S and K refers to the orientation of the polylinker to the T7 and T3 
primer sequences which flank the polylinker region ("S M is for Sad and "K" is for 
Kpnl which are the first sites on each respective end of the linker). "+" or "-" refer to 
the orientation of the fl origin of replication fori"), such that in one orientation, 
single stranded rescue initiated from the f 1 ori generates sense strand DNA and in the 
other, antisense. 

Vectors pSport 1 , pCMVSport 2.0 and pCMVSport 3.0, were obtained from 
Life Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Sport vectors 
contain an ampicillin resistance gene and may be transformed into E. coli strain 
DH10B, also available from Life Technologies. (See, for instance, Gruber, C. E., et 
al., Focus 15:59 (1993).) Vector lafmid BA (Bento Soares, Columbia University, 
NY) contains an ampicillin resistance gene and can be transformed into E. coli strain 
XL-1 Blue. Vector pCR®2.l, which is available from Invitrogen, 1600 Faraday 
Avenue, Carlsbad, CA 92008, contains an ampicillin resistance gene and may be 
transformed into E. coli strain DH10B, available from Life Technologies. (See, for 
instance, Clark, J. M., Nuc. Acids Res. 16:9677-9686 (1988) and Mead, D, et al., 
Bio/Technology 9: (1991).) Preferably, a polynucleotide of the present invention 
does not comprise the phage vector sequences identified for the particular clone in 
Table I, as well as the corresponding plasmid vector sequences designated above. 

The deposited material in the sample assigned the ATCC Deposit Number 
cited in Table 1 for any given cDNA clone also may contain one or more additional 
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plasmids, each comprising a cDNA clone different from that given clone. Thus, 
deposits sharing the same ATCC Deposit Number contain at least a plasmid for each 
cDNA clone identified in Table 1. Typically, each ATCC deposit sample cited in 
Table 1 comprises a mixture of approximately equal amounts (by weight) of about 50 
plasmid DN As, each containing a different cDNA clone; but such a deposit sample 
may include plasmids for more or less than 50 cDNA clones, up to about 500 cDNA 
clones. 

Two approaches can be used to isolate a particular clone from the deposited 
sample of plasmid DNAs cited for that clone in Table 1. First, a plasmid is directly 
isolated by screening the clones using a polynucleotide probe corresponding to SEQ 
IDNOrX. 

Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized 
using an Applied Biosystems DNA synthesizer according to the sequence reported. 
The oligonucleotide is labeled, for instance, with "P-y-ATP using T4 polynucleotide 
kinase and purified according to routine methods. (E.g., Maniatis et al., Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) 
The plasmid mixture is transformed into a suitable host, as indicated above (such as 
XL- 1 Blue (Stratagene)) using techniques known to those of skill in the art, such as 
those provided by the vector supplier or in related publications or patents cited above. 
The transformants are plated on 15% agar plates (containing the appropriate selection 
agent, e.g., ampicillin) to a density of about 150 transformants (colonies) per plate. 
These plates are screened using Nylon membranes according to routine methods for 
bacterial colony screening (e.g., Sambrook et al., Molecular Cloning: A Laboratory 
Manual, 2nd Edit., (1989), Cold Spring Harbor Laboratory Press, pages 1.93 to 
1.104), or other techniques known to those of skill in the art. 

Alternatively, two primers of 17-20 nucleotides derived from both ends of the 
SEQ ID NO:X (i.e., within the region of SEQ ID NO:X bounded by the 5* NT and the 
3' NT of the clone defined in Table 1) are synthesized and used to amplify the desired 
cDNA using the deposited cDNA plasmid as a template. The polymerase chain 
reaction is carried oui under routine conditions, for instance, in 25 \l\ of reaction 
mixture with 0.5 ug of the above cDNA template. A convenient reaction mixture is 
1 .5-5 mM MgCi_„ 0.01 % (w/v) gelatin. 20 uM each of dATP, dCTP, dGTP, dTTP, 25 
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pmol of each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR 
(denaturation at 94°C for 1 min; annealing at 55°C for 1 min; elongation at 72°C for i 
min) are performed with a Perkin-Elmer Cetus automated thermal cycler. The 
amplified product is analyzed by agarose gel electrophoresis and the DNA band with 
expected molecular weight is excised and purified. The PCR product is verified to be 
the selected sequence by subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5* or 3' non-coding 
portions of a gene which may not be present in the deposited clone. These methods 
include but are not limited to, filter probing, clone enrichment using specific probes, 
and protocols similar or identical to 5' and 3' "RACE" protocols which are well 
known in the art. For instance, a method similar to 5' RACE is available for 
generating the missing 5' end of a desired full-length transcript. (Fromont-Racine et 
al., Nucleic Acids Res. 21(7): 1683-1684 (1993).) 

Briefly, a specific RNA oligonucleotide is ligated to the 5" ends of a 
population of RNA presumably containing full-length gene RNA transcripts. A 
primer set containing a primer specific to the ligated RNA oligonucleotide and a 
primer specific to a known sequence of the gene of interest is used to PCR amplify 
the 5' portion of the desired full-length gene. This amplified product may then be 
sequenced and used to generate the full length gene. 

This above method starts with total RNA isolated from the desired source, 
although poly-A+ RNA can be used. The RNA preparation can then be treated with 
phosphatase if necessary to eliminate 5' phosphate groups on degraded or damaged 
RNA which may interfere with the later RNA ligase step. The phosphatase should 
then be inactivated and the RNA treated with tobacco acid pyrophosphatase in order 
to remove the cap structure present at the 5' ends of messenger RNAs; This reaction 
leaves a 5* phosphate group at the 5' end of the cap cleaved RNA which can then be 
ligated to an RNA oligonucleotide using T4 RNA ligasc. 

This modified RNA preparation is used as a template for first strand cDN A 
synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is 
used as a template for PCR amplification of the desired 5' end using a primer specific 
to the ligated RNA oligonucleotide and a primer specific to the known sequence of 
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the gene of interest The resultant product is then sequenced and analyzed to confirm 
that the 5' end sequence belongs to the desired gene. 

Example 2: Isolation of Geno mic Clones Corresponding to a Polvnuy ipnfiHp 

A human genomic PI library (Genomic Systems, Inc.) is screened by PCR 
using primers selected for the cDNA sequence corresponding to SEQ ID NO:X., 
according to the method described in Example 1 . (See also, Sambrook.) 

Example 3 : Tissue Distribution of Polypeptide 

Tissue distribution of mRNA expression of polynucleotides of the present 
invention is determined using protocols for Northern biot analysis, described by, 
among others, Sambrook et al. For example, a cDNA probe produced by the method 
described in Example I is labeled with P" using the rediprime™ DNA labeling 
system ( Amersham Life Science), according to manufacturer's instructions. After 
labeling, the probe is purified using CHROMA SPIN- 100™ column (Clontech 
Laboratories. Inc.), according to manufacturer's protocol number PT1200-L The 
purified labeled probe is then used to examine various human tissues for mRNA 
expression 

Multiple Tissue Northern (MTN) blots containing various human tissues (H) 
or human immune system tissues (IM) (Clontech) are examined with the labeled 
probe using Express Hyb™ hybridization solution (Clontech) according to 
manufacturer's protocol number PT1 190-1. Following hybridization and washing, the 
blots are mounted and exposed to film at -70°C overnight, and the films developed 
according to standard procedures. 

Example 4: Chrom osomal Mapping of the Polynucleotides 

An oligonucleotide primer set is designed according to the sequence at the 5' 
end of SEQ ID NO:X. This primer preferably spans about 100 nucleotides. This 
primer set is then used in a polymerase chain reaction under the following set of 
conditions : 30 seconds, 95°C; 1 minute, 56°C; 1 minute, 70°C. This cycle is 
repeated 32 times followed by one 5 minute cycle at 70°C. Human, mouse, and 
hamster DNA is used as template in addition to a somatic cell hybrid panel containing 
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individual chromosomes or chromosome fragments (Bios, Inc), The reactions is 
analyzed on either 8% polyacryiamide gels or 3.5 % agarose gels. Chromosome 
mapping is determined by the presence of an approximately 100 bp PCR fragment in 
the particular somatic cell hybrid. 

Example 5: Bacterial Expression of a Polypeptide 

A polynucleotide encoding a polypeptide of the present invention is amplified 
using PCR oligonucleotide primers corresponding to the 5' and 3* ends of the DNA 
sequence, as outlined in Example 1, to synthesize insertion fragments. The primers 
used to amplify the cDNA insert should preferably contain restriction sites, such as 
BamHI and Xbal, at the 5' end of the primers in order to clone the amplified product 
into the expression vector. For example, BamHI and Xbal correspond to the 
restriction enzyme sites on the bacterial expression vector pQE-9. (Qiagen, Inc., 
Chatsworth, CA). This plasmid vector encodes antibiotic resistance (Amp r ), a 
bacterial origin of replication (ori), an IPTG-regulatable promoter/operator (P/O), a 
ribosome binding site (RBS), a 6-histidine tag (6-His). and restriction enzyme cloning 
sites. 

The pQE-9 vector is digested with BamHI and Xbal and the amplified 
fragment is ligated into the pQE-9 vector maintaining the reading frame initiated at 
the bacterial RBS. The ligation mixture is then used to transform the E. coli strain 
M15/rep4 (Qiagen, Inc.) which contains multiple copies of the plasmid pREP4, which 
expresses the lacl repressor and also confers kanamycin resistance (Kan r ). 
Transformants are identified by their ability to grow on LB plates and 
ampicillin/kanamycin resistant colonies are selected. Plasmid DNA is isolated and 
confirmed by restriction analysis. 

Clones containing the desired constructs are grown overnight (O/N) in liquid 
culture in LB media supplemented with both Amp { 100 ug/ml) and Kan (25 ug/ml). 
The O/N culture is used to inoculate a large culture at a ratio of 1 : 100 to 1 :250. The 
cells are grown to an optical density 600 (CD. 6 ™) of between 0.4 and 0.6. IPTG 
(Isopropy!-B-D-ihiogalacto pyranostde) is then added to a final concentration of 1 
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mM. IPTG induces by inactivating the lacl repressor, clearing the P/O leading to 
increased gene expression. 

Cells are grown for an extra 3 to 4 hours. Cells are then harvested by 
centrirugation (20 rains at 6000Xg). The cell pellet is solubilized in the chaotropic 
agent 6 Molar Guanidine HCI by stirring for 3-4 hours at 4°C. The cell debris is 
removed by centrifugaiion, and the supernatant containing the polypeptide is loaded 
onto a nickel-nitrilo-iri-acetic acid ("Ni-NTA") affinity resin column (available from 
Q1AGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with 
high affinity and can be^purified in a simple one-step procedure (for details see: The 
QIAcxpressionist (1995) QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCI, pH 
8, the column is first washed with 10 volumes of 6 M guanidine -I ICJ, pH 8, then 
washed with 10 volumes of 6 M guanidine-HCI pH 6, and finally the polypeptide is 
eluted with 6 M guanidine-HCI, pH 5. 

The purified protein is then renatured by dialyzing it against phosphate- 
buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. 
Alternatively, the protein can be successfully refolded while immobilized on the Ni- 
NTA column. The recommended conditions are as follows: renature using a linear 
6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, 
containing protease inhibitors. The renaturation should be performed over a period of 
1.5 hours or more. After renaturation the proteins are eluted by the addition of 250 
mM irnmidazole. Immidazole is removed by a final dialyzing step against PBS or 50 
mM sodium acetate pH 6 buffer plus 200 mM NaCl. The purified protein is stored at 
4° Cor frozen at -80° C 

In addition to the above expression vector, the present invention further 
includes an expression vector comprising phage operator and promoter elements 
operatively linked to a polynucleotide of the present invention, called pHE4a. (ATCC 
Accession Number 209645, deposited on February 25, 1998.) This vector contains: 
1) a neomycinphosphotransferase gene as a selection marker. 2) an E. coli origin of 
replication, 3) a T5 phage promoter sequence, 4) two lac operator sequences, 5) a 
Shine-Delgarno sequence, and 6) the lactose operon repressor gene (lacjq). The 
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origin of replication (oriC) is derived from pUCl9 (LTI, Gaithcrsburg, MD). The 
promoter sequence and operator sequences are made synthetically. 

DNA can be inserted into the pHEa by restricting the vector with Ndel and 
Xbal. BamHI, XhoL or Asp718, running the restricted product on a gel, and isolating 
the larger fragment (the stuffer fragment should be about 3 10 base pairs). The DNA 
insert is generated according to the PCR protocol described in Example 1, using PCR 
primers having restriction sites for Ndel (5" primer) and Xbal, BamHI, Xhol, or 
Asp7 18 (3* primer). The PCR insert is gel purified and restricted with compatible 
enzymes. The insert and vector are ligated according to standard protocols. 

The engineered vector could easily be substituted in the above protocol to 
express protein in a bacterial system. 

Example 6: Purifica tion of a Polypeptide from an Inclusion Body 

The following alternative method can be used to purify a polypeptide 
expressed in E coli when it is present in the form of inclusion bodies. Unless 
otherwise specified, all of the following steps are conducted at 4- 10°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-10^ and the cells harvested by continuous centrifugation at 
15.000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
weight of cell paste and the amount of purified protein required, an appropriate 
amount of cell paste, by weight, is suspended in a buffer solution containing 100 mlvl 
Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 

The cells are then lysed by passing the solution through a microfluidizer 
(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 
is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 xg for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine 
hydrochloride (GuHCl) for 2-4 hours. After 7000 xg centrifugation for 15 min., the 
pellet is discarded and the polypeptide containing supernatant is incubated at 4°C 
overnight to allow further GuHCl extraction. 
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Following high speed centrifugation (30,000 xg) to remove insoluble particles, 
the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA 
by vigorous stirring. The refolded diluted protein solution is kept at 4°C without 
mixing for 12 hours prior to further purification steps. 

To clarify the refolded polypeptide solution, a previously prepared tangential 
filtration unit equipped with 0. 16 urn membrane filter with appropriate surface area 
(e.g., Filtron). equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The 
filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosystems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted . 
with 250 mM. 500 mM. 1000 mM, and 1500 mM NaCl in the same buffer, in a 
stepwise manner. The absorbance at 280 nm of the effluent is continuously 
monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the polypeptide are then pooled and mixed with 4 
volumes of water. The diluted sample is then loaded onto a previously prepared set of 
tandem columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak 
anion (Poros CM-20, Perseptive Biosystems) exchange resins. The columns are 
equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 
mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eluted using 
a 10 column volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium 
acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are 
collected under constant A 280 monitoring of the effluent. Fractions containing the 
polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled. 

The resultant polypeptide should exhibit greater than 95% purity after the 
above refolding and purification steps. No major contaminant bands should be 
observed from Commassie blue stained 16% SDS-PAGE gel when 5 jug of purified 
protein is loaded. The purified protein can also be tested for endotoxin/LPS 
contamination, and typically the LPS content is less than 0.1 ng/ml according to LAL 
assays. 
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Example 7: Cloning and Expression of a Polypeptide in a Baculovirus 
Expression System 

In this example, (he plasmid shuttle vector pA2 is used to insert a 
polynucleotide into a baculovirus to express a polypeptide. This expression vector 
contains the strong polyherinn promoter of the Autographa calif ornica nuclear 
polyhedrosis virus (AcMNPV) followed by convenient restriction sites such as 
BamHI, Xba I and Asp718. The polyadenylation site of the simian virus 40 CSV40") 
is used for efficient polyadenylation. For easy selection of recombinant virus, the 
plasmid contains the beta-galactosidase gene from E. colt under control of a weak 
Drosophila promoter in the same orientation, followed by the polyadenylation signal 
of the polyhedrin gene. The inserted genes are flanked on both sides by viral * 
sequences for cell-mediated homologous recombination with wild-type viral DNA to 
generate a viable virus that express the cloned polynucleotide. 

Many other baculovirus vectors can be used in place of the vector above, such 
as pAc373, pVL941, and pAcIMl, as one skilled in the art would readily appreciate, 
as long as the construct provides appropriately located signals for transcription, 
translation, secretion and the like, including a signal peptide and an in-frame AUG as 
required. Such vectors are described, for instance, in Luckow et al.. Virology 170:31- 
39(1989). 

Specifically, the cDNA sequence contained in the deposited clone, including 
the AUG initiation codon and the naturally associated leader sequence identified in 
Table I T is amplified using the PCR protocol described in Example I. If the naturally 
occurring signal sequence is used to produce the secreted protein, the pA2 vector does 
not need a second signal peptide. Alternatively, the vector can be modified (pA2 GP) 
to include a baculovirus leader sequence, using the standard methods described in 
Summers et al M "A Manual of Methods for Baculovirus Vectors and Insect Cell 
Culture Procedures/' Texas Agricultural Experimental Station Bulletin No. 1555 
(1987). 

The amplified fragment is isolated from a 1% agarose gel using a 
commercially available kit CGeneclean," BIO 101 Inc., La Joila, Ca.). The fragment 
then is digested with appropriate restriction enzymes and again purified on a \% 
agarose gel. 
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The plasmid is digested with the corresponding restriction enzymes and 
optionally, can be dephosphorylated using calf intestinal phosphatase, using routine 
procedures known in the an. The DNA is then isolated from a 1% agarose gel using a 
commercially available kit ("Geneclean" BIO 101 Inc., La Jolla, Ca.). 

The fragment and the dephosphorylated plasmid are tigated together with T4 
DNA iigase. E. coli HB 101 or other suitable E. coti hosts such as XL- 1 Blue 
(Stratagene Cloning Systems, La Jolla, CA) cells are transformed with the ligation 
mixture and spread on culture plates. Bacteria containing the plasmid arc identified 
by digesting DNA from individual colonies and analyzing the digestion product by 
gel electrophoresis. The sequence of the cloned fragment is confirmed by DNA 
sequencing. 

Five n.g of a plasmid containing the polynucleotide is co-transfected with 1.0 
Hg of a commercially available linearized baculovirus DNA ("BaculoGold™ 
baculovinis DNA", Pharmingen, San Diego, CA), using the lipofection method 
described by Feigner et ah, Proc. Natl. Acad. Sci. USA 84:7413-7417 (1987). One ug 
of BaculoGold™ virus DNA and 5 fig of the plasmid are mixed in a sterile well of a 
microtitcr plate containing 50 u\l of serum-free Grace's medium (Life Technologies 
Inc., Gaithersburg, MD). Afterwards, 10 u:l Lipofectin plus 90 u.1 Grace's medium are 
added, mixed and incubated for 15 minutes at room temperature. Then the 
transfection mixture is added drop-wise to Sf9 insect cells (ATCC CRL 171 1) seeded 
in a 35 mm tissue culture plate with 1 ml Grace's medium without serum. The plate is 
then incubated for 5 hours at 27° C. The transfection solution is then removed from 
the plate and 1 ml of Grace's insect medium supplemented with 10% fetal calf serum 
is added. Cultivation is then continued at 27° C for four days. 

After four days the supernatant is collected and a plaque assay is performed, 
as described by Summers and Smith, supra. An agarose gel with "Blue Gal" (Life 
Technologies Inc., Gaithersburg) is used to allow easy identification and isolation of 
gal-cxpressing clones, which produce blue-stained plaques. (A detailed description of 
a "plaque assay" of this type can also be found in the user's guide for insect cell 
culture and baculovirology distributed by Life Technologies Inc., Gaithersburg, page 
9-10.) After appropriate incubation, blue stained plaques are picked with the tip of a 
micropipettor (e.g., Eppendorf). The agar containing the recombinant viruses is then 
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resuspended in a microcentrifuge tube containing 200 |jj of Grace's medium and the 
suspension containing the recombinant baculovirus is used to infect Sf9 cells seeded 
in 35 mm dishes. Four days later the supernatams of these culture dishes are 
harvested and then they are stored at 4° C. 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace's 
medium supplemented with 10% heat-inactivated FBS. The cells are infected with 
the recombinant baculovirus containing the polynucleotide at a multiplicity of 
infection ("MOI") of about 2. If radiolabeled proteins arc desired, 6 hours later the 
medium is removed and is replaced with SF900 II medium minus methionine and 
cysteine (available from Life Technologies Inc., Rockville, MD). After 42 hours, 5. 
HCi of 35 S-methionine and 5 uCi 35 S-cysteine (available from Amersham) are added. 
The cells are further incubated for 16 hours and then are harvested by centrifugatioa 
The proteins in the supernatant as well as the intracellular proteins are analyzed by 
SDS-PAGE followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of 
purified protein may be used to determine the amino terminal sequence of the 
produced protein. 

Example 8: Expression of a Polypeptide in Mammalian Cells 

The polypeptide of the present invention can be expressed in a mammalian 
cell. A typical mammalian expression vector contains a promoter element, which 
mediates the initiation of transcription of mRNA, a protein coding sequence, and 
signals required for the termination of transcription and polyadenylation of the 
transcript. Additional elements include enhancers, Kozak sequences and intervening 
sequences flanked by donor and acceptor sites for RNA splicing. Highly efficient 
transcription is achieved with the early and late promoters from S V40, the long 
terminal repeats (LTRs) from Retroviruses, e.g., RSV, HTLVT. HIVI and the early 
promoter of the cytomegalovirus (CMV). However, cellular elements can also be 
used (e.g., the human actin promoter). 

Suitable expression vectors for use in practicing the present invention include, 
for example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), 
pRSVcat (ATCC 37152), pSV2chfr (ATCC 37146), pBC12MI (ATCC 67109), 



WO 00/06698 



316 



PCT/US99/17U0 



pCMVSport 2.0, and pCMVSport 3.0. Mammalian host cells that could be used 
include, human Hela, 293, H9 and Jurkat cells, mouse NIH3T3 and CI 27 cells, Cos 1, 
Cos 7 and CV1, quail QC1-3 cells, mouse L cells and Chinese hamster ovary (CHO) 
cells. 

Alternatively, the polypeptide can be expressed in stable cell lines containing 
the polynucleotide integrated into a chromosome. The co-transfection with a 
selectable marker such as dhfr, gpt, neomycin, hygromycin allows the identification 
and isolation of the transfected cells. 

The transfected gene can also be amplified to express large amounts of the 
encoded protein. The DHFR (dihydrofolate reductase) marker is useful in developing 
ceil lines that carry several hundred or even several thousand copies of the gene of 
interest. (See, e.g., Alt. F. W., et al.. J. Biol. Chem. 253: 1357-1370 (1978); Hamlin, J. 
L. and Ma, C, Biochem. et Biophys. Acta, 1097: 107- 143 (1990); Page, M. J. and 
Sydenham, M. A., Biotechnology 9:64-68 ( 199 1 ).) Another useful selection marker 
is the enzyme glutamine synthase (GS) (Murphy et al., Biochem J. 227:277-279 
(1991); Bebbington et al., Bio/Technology 10:169-175 (1992). Using these markers, 
the mammalian cells are grown in selective medium and the cells with the highest 
resistance are selected. These cell lines contain the amplified gene(s) integrated into a 
chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 
production of proteins. 

Derivatives of the plasmid pSV2-dhfr (ATCC Accession No. 37146), the 
expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession 
No.209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (Cullen et 
al.. Molecular and Cellular Biology, 438-447 (March, 1985)) plus a fragment of the 
CMV-enhanccr (Boshart ct al., Cell 41:521-530 (1985).) Multiple cloning sites, e.g., 
with the restriction enzyme cleavage sites BamHI, Xbal and Asp7l8, facilitate the 
cloning of the gene of interest. The vectors also contain the 3' intron, the 
polyadenylation and termination signal of the rat preproinsulin gene, and the mouse 
DHFR gene under control of the SV40 early promoter. 

Specifically, the plasmid pC6, for example, is digested with appropriate 
restriction enzymes and then dephosphorylated using calf intestinal phosphates by 
procedures known in the art. The vector is then isolated from a I % agarose gel. 
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A polynucleotide of the present invention is amplified according to the 
protocol outlined in Example 1. If the naturally occurring signal sequence is used to 
produce the secreted protein, the vector does not need a second signal peptide. 
Alternatively, if the naturally occurring signal sequence is not used, the vector can be 
5 modified to include a heterologous signal sequence. (See, e.g., WO 96/34891.) 
The amplified fragment is isolated from a 1 % agarose gel using a 
15 cornrnercially available kit CGeneclean," BIO 101 Inc., La Jolla, Ca.). The fragment 

then is digested with appropriate restriction enzymes and again purified on a 1% 
agarose gel. 

*0 The amplified fragment is then digested with the same restriction enzyme and 

purified on a 1 % agarose gel. The isolated fragment and the dephosphorylated vector 
are then ligated with T4 DNA ligase. E. coli HB 101 or XL-1 Blue cells are then 
transformed and bacteria are identified that contain the fragment inserted into plasmid 
25 pC6 using, for instance, restriction enzyme analysis. 

15 Chinese hamster ovary cells lacking an active DHFR gene is used for 

transfection. Five u.g of the expression plasmid pC6 is cotransfected with 0.5 u.g of 
the plasmid pS Vneo using lipofectin (Feigner et al„ supra). The plasmid pSV2-neo 
contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme 
that confers resistance to a group of antibiotics including G418. The cells are seeded 
20 in alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are 
trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha 
minus MEM supplemented with 10, 25, or 50 ng/ml of metothrexate plus 1 mg/ml 
G418. After about 10-14 days single clones are trypsinized and then seeded in 6-well 
petri dishes or 10 ml flasks using different concentrations of methotrexate (50 nM. 
40 25 100 nM, 200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 

methotrexate are then uansfcrred to new 6-well plates containing even higher 
concentrations of methotrexate ( 1 uM. 2 jlM, 5 u:M, 10 mM ( 20 mM). The same 
procedure is repeated until clones are obtained which grow at a concentration of 100 - 
200 u.M. Expression of the desired gene product is analyzed, for instance, by SDS- 
30 PAGE and Western blot or by reversed phase HPLC analysis. 

50 Example 9: Protein Fnsitms 
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The polypeptides of the present invention are preferably fused to other 
proteins. These fusion proteins can be used for a variety of applications. For 
example, fusion of the present polypeptides to His-tag, HA-tag, protein A, IgG 
domains, and maltose binding protein facilitates purification. (See Example 5; see 
also EP A 394,827; Traunecker, et al., Nature 33 1:84-86 (1988).) Similarly, fusion to 
IgG-1, IgG-3, and albumin increases the halfiife time in vivo. Nuclear localization 
signals fused to the polypeptides of the present invention can target the protein to a 
specific subcellular localization, while covalent hctcrodimer or homodimers can 
increase or decrease the activity of a fusion protein. Fusion proteins can also create 
chimeric molecules having more than one function. Finally, fusion proteins can 
increase solubiliry and/or stability of the fused protein compared to the non-fused 
protein. AH of the types of fusion proteins described above can be made by 
modifying the following protocol, which outlines the fusion of a polypeptide to an 
IgG molecule, or the protocol described in Example 5. 

Briefly, the human Fc portion of the IgG molecule can be PCR amplified, 
using primers that span the 5' and 3' ends of the sequence described below. These 
primers also should have convenient restriction enzyme sites that will facilitate 
cloning into an expression vector, preferably a mammalian expression vector. 

For example, if pC4 (Accession No. 209646) is used, the human Fc portion 
can be ligated into the BamHI cloning site. Note that the 3* BamHI site should be 
destroyed. Next, the vector containing the human Fc portion is re-restricted with 
BamHI, linearizing the vector, and a polynucleotide of the present invention, isolated 
by the PCR protocol described in Example I, is ligated into this BamHI site. Note 
that the polynucleotide is cloned without a stop codon, otherwise a fusion protein will 
not be produced. 

If the naturally occurring signal sequence is used to produce the secreted 
protein, pC4 does not need a second signal peptide. Alternatively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. (See, e.g., WO 96/34891.) 



Human IgG Fc region: 
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GGGATCCGGAGCCCAAATCTTCTGACAAAACTCACACATGQCCACCGTGC 

CCAGCACCTX3AATTCGAGGGTGCACCGTCAGTCTTCCTCITCCCCCCA^ 

CCCAAGGACACCCTCATGATCTCCCGGACTCCTGAGGTCACATGCGTGGT 

GGTGGACGTAAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGG 

ACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTA 

CAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACT 

GGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCA 

ACCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAAC 

CACAGGTGTACACCCJGCCCCCATCCCGGGATGAGCTGACCAAGAACCAG 

GTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCAAGCGACATCGCCGX 

GGAGTGOGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCr 

CCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTG 

GACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCA 

TGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGG 

GTAAATGAGTGCGACGGCCGCGACTCTAGAGGAT (SEQ ID NO:l) 

Example 10; Production of an Antib o dy from a Polypeptide 

The antibodies of the present invention can be prepared by a variety of 
methods. (See, Current Protocols, Chapter 2.) For example, cells expressing a 
polypeptide of the present invention is administered to an animal to induce the 
production of sera containing polyclonal antibodies. In a preferred method, a 
preparation of the secreted protein is prepared and purified to render it substantially 
free of natural contaminants. Such a preparation is then introduced into an animal in 
order to produce polyclonal antisera of greater specific activity. 

In the most preferred method, the antibodies of the present invention are 
monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kohler et al., Nature 
256:495 (1975); Kohler et al., Eur. J. Immunol. 6:51 1 (1976); Kohler et a!., Eur. J. 
Immunol. 6:292 (1976); Hammerling et al., in: Monoclonal Antibodies and T-Cell 
Hybridomas, Elsevier, N.Y., pp. 563-681 (198 1).) In genend, such procedures 
involve immunizing an animal (preferably a mouse) with polypeptide or, more 
preferably, with a secreted poiypeptide-expressing cell. Such cells may be cultured in 
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any suitable tissue culture medium; however, it is preferable to culture cells in Earle's 
modified Eagle's medium supplemented with 10% fetal bovine serum (inactivated at 
about 56°C), and supplemented with about 10 g/1 of nonessential amino acids, about 
1,000 U/ml of penicillin, and about 100 ug/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma 
cell line. Any suitable myeloma cell line may be employed in accordance with the 
present invention; however, it is preferable to employ the parent myeloma cell line 
(SP20), available from the ATCC. After fusion, the resulting hybridoma cells are 
selectively maintained in HAT medium, and then cloned by limiting dilution as 
described by Wands et al. (Gastroenterology 80:225-232 (1981).) The hybridoma 
cells obtained through such a selection are then assayed to identify clones which 
secrete antibodies capable of binding the polypeptide. 

Alternatively, additional antibodies capable of binding to the polypeptide can 
be produced in a two-step procedure using anti-idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves antigens, and therefore, it is 
possible to obtain an antibody which binds to a second antibody. In accordance with 
this method, protein specific antibodies are used to immunize an animal, preferably a 
mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones which produce an anLibody 
whose ability to bind to the protein-specific antibody can be blocked by the 
polypeptide. Such antibodies comprise anti-idiotypic antibodies to the protein- 
specific antibody and can be used to immunize an animal to induce formation of 
further protein-specific antibodies. 

It will be appreciated that Fab and F(ab')2 and other fragments of the 
antibodies of the present invention may be used according to the methods disclosed 
herein. Such fragments are typically produced by proteolytic cleavage, using 
enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab)2 
fragments). Alternatively, secreted protein-binding fragments can be produced 
through the application of recombinant DNA technology or through synthetic 
chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced 
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using genetic constructs derived from hybridoma cells producing the monoclonal 
antibodies described above. Methods for producing chimeric antibodies are known in 
the art. (See, for review, Morrison, Science 229:1202 (1985); Oi et al., 
BioTechniques 4:214 (1986); Cabilly et ah, U.S. Patent No. 4,816,567; Taniguchi et 
al, EP 171496; Morrison et al., EP 173494; Neuberger et al., WO 8601533; Robinson 
et al., WO 8702671; Boulianne et al., Nature 312:643 (1984); Neuberger et al., Mature 
314:268(1985).) 

Example 1 1 : Production Of Secreted Protein Fnr H teh-Throughmii Scr«Pnin P 
Assays 

The following protocol produces a supernatant containing a polypeptide to be 
tested. This supernatant can then be used in the Screening Assays described in 
Examples 13-20. 

First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution 
(Img/ml in PBS) 1:20 in PBS (w/o calcium or magnesium 17-516F Biowhittaker) for 
a working solution of 50ug/ml. Add 200 ul of this solution to each well (24 well 
plates) and incubate at RT for 20 rninules. Be sure to distribute the solution over each 
well (note: a 12-channeI pipetter may be used with tips on every other channel). 
Aspirate off the Poly-D-Lysine solution and rinse with 1ml PBS (Phosphate Buffered 
Saline). The PBS should remain in the well until just prior to plating the cells and 
plates may be poly-lysine coated in advance tor up to two weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 10 5 cells/well in .5ml 
DMEM(Dulbecco\s Modified Eagle Medium)(with 4.5 G/L glucose and L-glutamine 
(12-604FBiowhittaker))/L0% heat inactivated FBS(14-503F Biowhittaker)/lx 
Penstrep( 17-602E Biowhittaker). Let the cells grow overnight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofcctamine 
(18324-012 Gibco/BRL) and 5ml Optimem I (31985070 Gibco/BRL)/96-well plate. 
With a small volume multi-channel pipetter, aliquot approximately 2ug of an 
expression vector containing a polynucleotide insert, produced by the methods 
described in Examples 8 or 9, into an appropriately labeled 96-well round bottom 
plate. With a multi-channel pipetter, add 50ul of the Lipofectamine/Optirnem I 
mixture to each well. Pipette up and down gently to mix. Incubate at RT 15-45 
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minutes. After about 20 minutes, use a multi-channel pipetter to add 150ul Optimem 
I to each well. As a control, one plate of vector DNA lacking an insert should be 
transfected with each set of transfections. 

Preferably, the transfection should be performed by tag-teaming the following 
tasks. By tag-teaming, hands on time is cur in half, and the cells do not spend too 
much time on PBS. First, person A aspirates off the media from four 24-well plates 
of cells, and then person B rinses each well with .5- lml PBS. Person A then aspirates 
off PBS rinse, and person B, using al2-channel pipetter with tips on every other 
channel, adds the 200ul o^DNA/Lirx)fectaniine/Optimem I complex to the odd wells 
first, then to the even wells, to each row on the 24-well plates. Incubate at 37°C for 6 
hours. 

While cells are incubating, prepare appropriate media, either 1 %BS A in 
DMEM with lx penstrep, or CHO-5 media (1 16.6 mg/L of CaC12 (anhyd); 0.00130 
mg/L CuSCVSH.O; 0.050 mg/L of Fe(N0 3 ) 3 -9H 2 0; 0.417 mg/L of FeSO 4 -7H 2 0; 
3 11 .80 mg/L of Kcl; 28.64 mg/L of MgCL; 48.84 mg/L of MgS0 4 ; 6995.50 mg/L of 
NaCl; 2400.0 mg/L of NaHC0 3 ; 62.50 mg/L of NaH 2 PO 4 -H,0; 71.02 mg/L of 
NajHP04; .4320 mg/L of ZnS0 4 -7H,0; .002 mg/L of Arachidonic Acid ; 1 .022 mg/L 
of Cholesterol; .070 mg/L of DL-alpha-Tocopherol- Acetate; 0.0520 mg/L of Linoteic 
Acid; 0.010 mg/L of Linolenic Acid; 0.010 mg/L of Myristic Acid; 0.010 mg/L of 
Oleic Acid; 0.010 mg/L of Pairaitric Acid; 0.010 mg/L of Palmitic Acid; 100 mg/L of 
Pluronic F-68; 0.010 mg/L of Stearic Acid; 2.20 mg/L of Tween 80; 4551 mg/L of D- 
Glucose; 130.85 mg/ml of L- Alanine; 147.50 mg/ml of L-Arginine-HCL; 7.50 mg/ml 
of L-Asparagine-H 2 0; 6.65 mg/ml of L-Aspartic Acid; 29.56 mg/ml of L-Cystine- 
2HCL-H 2 0; 31.29 mg/ml of L-Cystine-2HCL; 7.35 mg/ml of L-Glutarnic Acid; 365.0 
mg/ml of L-Glutamine; 18.75 mg/ml of Glycine; 52.48 mg/ml of L-Histidine-HCL- 
H 2 0; 106.97 mg/ml of L-Isoleucine; 1 11.45 mg/ml of L-Leucinc; 163.75 mg/ml of L- 
Lysine HCL; 32.34 mg/ml of L-Methionine; 68.48 mg/ml of L-Phenylalainine; 40.0 
mg/ml of L-Proline; 26.25 rag/mi of L-Serine; 101.05 mg/ml of L-Threonine; 19.22 
mg/ml of L-Tryptophan; 91.79 mg/ml of L-Tryrosine-2Na-2H : 0; 99.65 mg/ml of L- 
Valine; 0.0035 mg/L of Biotin; 3.24 mg/L of D-Ca Pantothenate; 1 1 .78 mg/L of 
Choline Chloride; 4.65 mg/L of Folic Acid; 1 5.60 mg/L of i-Inositol; 3.02 mg/L of 
Niacinamide; 3.00 mg/L of Pyridoxal HCL; 0.03 1 mg/L of Pyridoxine HCL; 0.3 19 
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mg/L of Riboflavin; 3. 17 mg/L of Thiamine HCL; 0.365 mg/L of Thymidine; and 
0.680 mg/L of Vitamin B I2 ; 25 mM of HEPES Buffer; 2.39 mg/L of Na 
Hypoxanthine; 0. 105 mg/L of Lipoic Acid; 0.08 1 mg/L of Sodium Putrescine-2HCL; 
55.0 mg/L of Sodium Pyruvate; 0.0067 mg/L of Sodium Selenite; 20uM of 
Ethanolamine; 0.122 mg/L of Ferric Ciirate; 41.70 mg/L of Methyl-B-Cyclodextrin 
complexed with Linoleic Acid; 33.33 mg/L of Methyl-B-Cyclodextrin complexed 
with Oleic Acid; and 10 mg/L of Methyl-B-Cyclodextrin complexed with Retinal) 
with 2mm glutamine and lx penstrep. (BSA (81-068-3 Bayer) lOOgm dissolved in 1L 
DMEM for a 10% BSA stock solution). Filter the media and collect 50 ul for 
endotoxin assay in 15ml polystyrene conical. 

The transfection reaction is terminated, preferably by tag-teaming, at the end 
of the incubation period. Person A aspirates off the transfection media, while person 
B adds 1 .5ml appropriate media to each well. Incubate at 37°C for 45 or 72 hours 
depending on the media used: 1%BSA for 45 hours or CHO-5 for 72 hours. 

On day four, using a 300ul multichannel pipetter, aliquot 600ul in one 1ml 
deep well plate and the remaining supernatant into a 2ml deep well. The supernatants 
from each well can then be used in the assays described in Examples 13-20. 

It is specifically understood that when activity is obtained in any of the assays 
described below using a supernatant, the activity originates from either the 
polypeptide directly (e.g., as a secreted protein) or by the polypeptide inducing 
expression of other proteins, which are then secreted into the supernatant. Thus, the 
invention further provides a method of identifying the protein in the supernatant 
characterized by an activity in a particular assay. 

Example 12; Construction of GAS Reporter Construct 

One signal transduction pathway involved in the differentiation and 
proliferation of cells is called the Jaks-STATs pathway. Activated proteins in the 
Jaks-STATs pathway bind to gamma activation site "GAS" elements or interferon- 
sensitive responsive element ("ISRE"), located in the promoter of many genes. The 
binding of a protein to these elements alter the expression of the associated gene. 

GAS and ISRE elements are recognized by a class of transcription factors 
called Signal Transducers and Activators of Transcription, or "STATs." There are six 
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members of the STATs family. Stat I and Stat3 are present in many cell types, as is 
Stat2 (as response to IFN-alpha is widespread). Stat4 is more restricted and is not in 
many cell types though it has been found in T helper class I, cells after treatment with 
IL- 12. StatS was originally called mammary growth factor, but has been found at 
higher concentrations in other cells including myeloid cells. It can be activated in 
tissue culture cells by many cytokines. 

The STATs are activated to translocate from the cytoplasm to the nucleus 
upon tyrosine phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") 
family. Jaks represent ^ distinct family of soluble tyrosine kinases and include Tyk2, 
JakI, Jak2. and Jak3. These kinases display significant sequence similarity and arc 
generally catalytically inactive in resting cells. 

The Jaks are activated by a wide range of receptors summarized in the Table 
below. (Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 64:62 1 - 
51 (1995).) A cytokine receptor family, capable of activating Jaks, is divided into two 
groups: (a) Class 1 includes receptors for IL-2, IL-3, IL-4, IL-6, IL-7, 1L-9, IL-1 1, IL- 
12, IL-1 5, Epo, PRL. GH, G-CSF, GM-CSF, LIF, CNTF, and thrombopoietin; and (b) 
Class 2 includes IFN-a, IFN-g, and 1L-10. The Class I receptors share a conserved 
cysteine motif (a set of four conserved cysteines and one tryptophan) and a WSXWS 
motif (a membrane proximal region encoding Trp-Ser-Xxx-Trp-Ser (SEQ ID NO:2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn 
activate STATs, which then translocate and bind to GAS elements. This entire 
process is encompassed in the Jaks-STATs signal transduction pathway. 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of 
the GAS or the ISRE element, can be used to indicate proteins involved in the 
proliferation and differentiation of cells. For example, growth factors and cytokines 
are known to activate the Jaks-STATs pathway. (See Table below.) Thus, by using 
GAS elements linked to reporter molecules, activators of the Jaks-STATs pathway 
can be identified. 
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JAKs 

Ueand tvk2 JakI Jak2 JaJc3 

I FN family 



IFN-a/B 




+ 




IFN-g 




+ 


+ 


D-10 


+ 


9 




CD 130 familv 








IL-6 (Pleiotrophic) 


+ 


+ 


+ ? 


IMl(Pleiotrophic) 


7 




? ? 


OnM(Pleiotrophic) 


o 




+ ? 


LIF(Pleiotrophic) 


? 


+ 


+ ? 


CNTF(Pleiotrophic) 


-/+ 




+ ? 


G-CSF(Pleiotrophic) 


*> 


+ 


? ? 


IL-12(Pleiotrophic) 


+ 




+ + 


s-C familv 








IL-2 (lymphocytes) 




+ 


+ 


IL-4 (lymph/mycloid) 




+ 


+ 


IL-7 (lymphocytes) 




+ 


+ 


IL-9 (lymphocytes) 




+ 


+ 


IL-13 (lymphocyte) 




+ 


? ? 


IL-15 


? 


+ 


? + 


SD140 familv 









IL-3 (myeloid) + 
IL-5 (myeloid) + 
GM-CSF (myeloid) - - + 



Growth hormone family 
GH ? 



STATS GASrelementsl nr FSRR 

1,23 ISRE 

1 GAS (IRFl>Lys6>IFP) 

1,3 

1,3 GAS (IRFl>Lys6>IFP) 



1,3 
1,3 
1.3 
1,3 
1,3 
1,3 



1,3,5 GAS 

6 GAS (IRF1 = [FP »Ly6)(IgH) 

5 GAS 

5 GAS 

6 GAS 
5 GAS 

5 GAS (IRFl>[FP»Ly6) 

5 GAS 

5 GAS 



5 
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PRL ? +/-, + 

EPO ? - + 

Receptor Tyrosine Kinares 

EGF ? + + 

PDGF ? + + 

CSF-1 ? + + 



1*3,5 

5 G AS(B-CAS>IRF I =IFP»Ly 6) 

1,3 GAS (IRFl) 
1,3 

1,3 GAS (not IRFl) 
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To construct a synthetic GAS containing promoter element which is used in 
the Biological Assays described in Examples 13-14, a PCR based strategy is 
employed to generate a GAS-SV40 promoter sequence. The 5' primer contains four 
tandem copies of the GAS binding site found in the IRF1 promoter and previously 
demonstrated to bind STATs upon induction with a range of cytokines (Roth man et 
at.. Immunity 1:457-468 (1994).), although other GAS or ISRE elements can be used 
instead. The 5' primer also contains 18bp of sequence complementary to the SV40 
early promoter sequence and is flanked with an Xhol site. The sequence of the 5' 
primer is: 

3? 

5\GCGCCTCGAGATTTCCCCGAAATCTAGATTTCCCCGAAATGATTTCC(:C 
G A A ATG ATTTCCCCG A A AT ATCTGCCATCTC AATTAG : 3 ' (SEQ ID NO:3)' 

The downstream primer is complementary to the S V40 promoter and is 
flanked with a Hind IH site: 5 *:GCGGCA AGCTTTTTGCAAAGCCTAGGC:3* 
(SEQ ID NO:4) 

PCR amplification is performed using the S V40 promoter template present in 
the B-gal: promoter plasmid obtained from Clontech. The resulting PCR fragment is 
digested with Xhol/Hind III and subcloncd into BLSK2-. (Stratagene.) Sequencing 
with forward and reverse primers confirms that the insert contains the following 
sequence: 

^''CTCGAGATTTCCCCGAAATCTAGATTTCCCCGAAATGATTTCCCCGAAA 

TGATTTCCCCGAAATATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCG 

CCCCTAACTCCGCCCATCTCGCCCCTAACTCCGCCCAGTTCCGCCCATTCT 

CCGCCCCATGGCTGACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCC 

TCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCT 

AGGCTTTTGC A A A AAG£ET: 3 * (SEQIDNO:5) 

With this GAS promoter element linked to the S V40 promoter, a GAS:SEAP2 
reporter construct is next engineered. Here, the reporter molecule is a secreted 
alkaline phosphatase, or "SEAP.** Clearly, however, any reporter molecule can be 
instead of SEAP, in this or in any of the other Examples. Well known reporter 
molecules that can be used instead of SEAP include chloramphenicol 
acetyltransferase (CAT), luciferase, alkaline phosphatase, B-gaiactosidase, green 
fluorescent protein (GFPh or any protein detectable by an antibody. 
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The above sequence confirmed synthetic GAS-SV40 promoter. element is 
subcloned into the pSE AP-Promoter vector obtained from Clontech using Hindm and 
Xhoi, effectively replacing the SV40 promoter with the amplified GAS:SV40 
promoter element, to create the G AS-SEAP vector. However, this vector does not 
contain a neomycin resistance gene, and therefore, is not preferred for mammalian 
expression systems. 

Thus, in order to generate mammalian stable cell lines expressing the G AS- 
SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using 
Sail and Notl, and inserted into a backbone vector containing the neomycin resistance 
gene, such as pGFP- 1 (Clontech). using these restriction sites in the multiple cloning 
site, to create the GAS-SEAP/Neo vector. Once this vector is transfected into 
mammalian cells, this vector can then be used as a reporter molecule for GAS binding 
as described in Examples 13-14. 

Other constructs can be made using the above description and replacing GAS 
with a different promoter sequence. For example, construction of reporter molecules 
containing NFK.-B and EGR promoter sequences are described in Examples 1 5 and 
16. However, many other promoters can be substituted using the protocols described 
in these Examples. For instance, SRJE, IL-2, NFAT, or Osteocalcin promoters can be 
substituted, alone or in combination (e.g., GAS/NF-KB/EGR, GAS/NF-KB, 11- 
2/NFAT, or NF-KB/GAS). Similarly, other cell lines can be used to test reporter 
construct activity, such as HELA (epithelial), HUVEC (endothelial), Reh (B-cell), 
Saos-2 (osteoblast), HUV AC (aortic), or Cardiomyocyte. 

Example 13: High -Throughput Screening Assay for T-cetl Activity. 

The following protocol is used to assess T-cell activity by identifying factors, 
such as growth factors and cytokines, that may proliferate or differentiate T-cells. T- 
cell activity is assessed using the GAS/SEAP/Neo construct produced in Example 12. 
Thus, factors that increase SEAP activity indicate the ability to activate the Jaks- 
STATS signal transduction pathway. The T-cell used in this assay is Jurkat T-cells 
(ATCC Accession No. TTB-152), although Molt-3 cells (ATCC Accession No. CRL- 
1552) and Molt-4 cells (ATCC Accession No. CRL-1582) cells can also be used. 
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Jurfcat T-cells are lymphoblastic CD4+ Thl helper cells. In order to generate 
stable cell lines, approximately 2 million Jurkat cells are transfected with the G AS- 
SEAP/neo vector using DMRE-C (Life Technologies)(transfection procedure 
described below). The transfected cells are seeded to a density of approximately 
20,000 cells per well and transfectants resistant to 1 mg/ml gcnticin selected. 
Resistant colonies are expanded and then tested for their response to increasing 
concentrations of interferon gamma. The dose response of a selected clone is 
demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells 
containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to • 
generate sufficient cells for multiple 96 well plates. Jurkat cells are maintained in 
RPMI + 10% scrum with l%Pen-Strep. Combine 25 mis of OPTI-MEM (Life 
Technologies) with 10 ug of plasmid DNA in a T25 flask. Add 2.5 ml OPTI-MEM 
containing 50 ul of DMRIE-C and incubate at room temperature for 1 5-45 mins. 

During the incubation period, count cell concentration, spin down the required 
number of cells (10 7 per transfection), and resuspend in OPTI-MEM to a final 
concentration of 10 7 cells/ml. Then add Iml of 1 x 10 7 cells in OPTI-MEM to T25 
flask and incubate at 37°C for 6 hrs. After the incubation, add 10 ml of RPMI +- 15% 
serum. 

The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI +- 10% 
serum, 1 mg/ml Genticin, and 1% Pen-Strep. These cells are treated with 
supematants containing a polypeptide as produced by the protocol described in 
Example 11. 

On the day of treatment with the supernatant, the ceils should be washed and 
resuspended in fresh RPMI + 10% serum to a density of 500,000 cells per ml. The 
exact number of cells required will depend on the number of supematants being 
screened. For one 96 well plate, approximately 10 million cells (for 10 plates, 100 
million cells) are required. 

Transfer the cells to a triangular reservoir boat, in order to dispense the cells 
into a 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 
200 ui of cells into each well (therefore adding 100. 000 cells per well). 
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After all the plates have been seeded, 50 ul of the supematants are transferred 
directly from the 96 well plate containing the supematants into each well using a 12 
1Q channel pipette. In addition, a dose of exogenous interferon gamma (0.1, 1 .0, 10 ng) 

is added to wells H9, HI0, and HI 1 to serve as additional positive controls for the 

5 assay. 

The 96 well dishes containing iurkat cells treated with supematants are placed 
15 in an incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul 

samples from each well are then transferred to an opaque 96 well plate using a 12 
channel pipette. The opapue plates should be covered (using sellophene covers) and 
20 10 stored at -20°C until SEAP assays are performed according to Example 17. The - 

plates containing the remaining treated cells are placed at 4°C and serve as a source 
of material for repeating the assay on a specific well if desired. 

As a positive control, 1 00 Unit/ml interferon gamma can be used which is 

25 

known to activate Jurkat T cells. Over 30 fold induction is typically observed in the 
15 positive control wells. 

The above protocol may be used in the generation of both transient, as well as, 
stable transfected cells, which would be apparent to those of skill in the art. 

Example 14: Hi ph -Through put Screening Assay Identifying Myeloid Activity 

20 The following protocol is used to assess myeloid activity by identifying 

35 factors, such as growth factors and cytokines, that may proliferate or differentiate 

myeloid cells. Myeloid cell activity is assessed using the GAS/SEAP/Neo construct 
produced in Example 12. Thus, factors that increase SEAP activity indicate the 
ability to activate the Jaks-STATS signal transduction pathwav. The myeloid cell 

40 

25 used in this assay is U937, a pre-monocyte cell line, although TF-! , HL60, or KG1 
can be used. 

To transiently transfect U937 cells with the GAS/SEAP/Neo construct 
45 produced in Example 12, a DEAE-Dextran method (Kharbanda et. al., 1994, Cell 

Growth & Differentiation, 5:259-265) is used First, harvest 2xl0e 7 U937 cells and 
30 wash with PBS. The U937 cells are usually grown in RPMI 1640 medium containing 

50 
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10% heat-inactivated fetal bovine serum (FBS) supplemented with 100 units/mi 
penicillin and 100 mg/ml streptomycin. 

Next, suspend the ceils in 1 ml of 20 mM Tris-HCI (pH 7.4) buffer containing 
0.5 mg/ml DEAE-Dcxtran. 8 ug GAS-SEAP2 plasmid DNA, 140 mM NaCl, 5 mM 
KCl 375 uM Na 2 HPO 4 .7H20, 1 mM MgCl2, and 675 uM CaCI 2 - Incubate at 37°C 
for 45 min. 

Wash the ceils with RPMI 1640 medium containing 10% FBS and then 
resuspend in 10 ml complete medium and incubate at 37°C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the ceils in 400 
ug/ml G418. The G4l8-free medium is used for routine growth but every one to two 
months, the cells should be re-grown in 400 ug/ml G418 for couple of passages. 

These cells are tested by harvesting 1x10* cells (this is enough for ten 96-well 
plates assay) and wash with PBS. Suspend the cells in 200 ml above described 
growth medium, with a final density of 5xt0 5 celis/ml. Plate 200 ul cells per well in 
the 96-well plate (or U10* cells/well). 

Add 50 ul of the supernatant prepared by the protocol described in Example 
II. Incubate at 37°C for 48 to 72 hr. As a positive control, 100 Unit/ml interferon 
gamma can be used which is known to activate U937 cells. Over 30 fold induction is 
typically observed in the positive control wells. SEAP assay the supernatant 
according to the protocol described in Example 17. 

Example 15: High-T hroughput Screening Assay Identifying Neuronal Activity. 

When cells undergo differentiation and proliferation, a group of genes are 
activated through many different signal transduction pathways. One of these genes, 
EGR1 (early growth response gene I ), is induced in various tissues and cell types 
upon activation. The promoter of EGR 1 is responsible for such induction. Using the 
EGR1 promoter linked to reporter molecules, activation of cells can be assessed. 

Particularly, the following protocol is used to assess neuronal activity in PC 12 
cell lines. PC 12 cells (rat phenochromocytoma cells) are known to proliferate and/or 
differentiate by activation with a number of mitogens, such as TPA (tetradecanoyl 
phorbol acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). 
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The EGR1 gene expression is activated during this treatment. Thus, by stably 
transfecting PC 12 cells with a construct containing an EGR promoter linked to SEAP 
reporter, activation of PC 12 cells can be assessed. 

The EGR/SEAP reporter construct can be assembled by the following 
protocol. The EGR- 1 promoter sequence (-633 to +l)(Sakamoto K et al.. Oncogene 
6:867-871 (1991)) can be PCR amplified from human genomic DNA using the 
following primers: 

5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3' (SEQ ID NO:6) 
5' GCGAAGCTyCGCGACTCCCCGGATCCGCCTC-3 , (SEQ ID NO:7) 
Using the GAS:SEAP/Neo vector produced in Example 12, EGR1 amplified 
product can then be inserted into this vector. Linearize the GAS:SEAP/Nco vector 
using restriction enzymes Xhol/HindM, removing the GAS/S V40 stuffer. Restrict the 
EGR1 amplified product with these same enzymes. Ligate the vector and the EGR1 
promoter. 

To prepare 96 well-plates for cell culture, two mis of a coating solution (1:30 
dilution of collagen type I (Upstate Biotech Inc. Cat#08-1 15) in 30%ethanol (filter 
sterilized)) is added per one 10 cm plate or 50 ml per well of the 96-well plate, and 
allowed to air dry fcr 2 hr. 

PC 12 cells are routinely grown in RPMI-1640 medium (Bio Whittaker) 
containing 10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat- 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 
100 ug/ml streptomycin on a precoated 10 cm tissue culture dish. One to four split is 
done every three to four days. Cells arc removed from the plates by scraping and 
resuspended with pipetting up and down for more than 15 times. 

Transfect the EGR/SEAP/Neo construct into PC 12 using the Lipofectamine 
protocol described in Example 11. EGR-SEAP/PC12 stable ceils are obtained by 
growing the cells in 300 ug/ml G418. The G4 1 8-frcc medium is used for routine 
growth but every one to two months, the cells should be re-grown in 300 ug/ml G418 
for couple of passages. 

To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS 
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(Phosphate buffered saline). Then starve the cells in low serum medium (RPMI-1640 
containing 1% horse serum and 0.5% FBS with antibiotics) overnight. 

The next morning, remove the medium and wash the cells with PBS. Scrape 
off the cells from the plate, suspend the cells well in 2 ml low serum medium. Count 

the cell number and add more low serum medium to reach final cell density as 5x10^ 
ceUs/mL 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent to 
IxlO 5 cells/well). Add 50 ul supernatant produced by Example 1 1, 37°C for 48 to 72 
hr. As a positive control, & growth factor known to activate PC 12 cells through EGR 
can be used, such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty-fold * 
induction of SEAP is typically seen in the positive control wells. SEAP assay the 
supernatant according to Example 17. 

Exam ple Ifo High -Through put Screening Assay fnr T-r pII Artlv j fy 

NF-kB (Nuclear Factor kB) is a transcription factor activated by a wide 
variety of agents including the inflammatory cytokines IL-1 and TNF, CD30 and 
CD40, lymphotoxin-alpha and lymphotoxin-bcta, by exposure to LPS or thrombin, 
and by expression of certain viral gene products. As a transcription factor, NF-icB 
regulates the expression of genes involved in immune cell activation, control of 
apoptosis (NF- kB appears to shield cells from apoptosis), B and T-cell development, 
anti -viral and antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- kB is retained in the cytoplasm with I-kB 
(Inhibitor kB). However, upon stimulation, I- kB is phosphorylated and degraded, 
causing NF- kB to shuttle to tlie nucleus, thereby activating transcription of target 
genes. Target genes activated by NF- kB include IL-2, TL-6, GM-CSF, ICAM-l and 
class 1 MHC. 

Due to its central role and ability to respond to a range of stimuli, reporter 
constructs utilizing the NF-kB promoter element are used to screen the supernatants 
produced in Example 11. Activators or inhibitors of NF-kB would be useful in 
treating diseases. For example, inhibitors of NF-kB could be used to treat those 
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diseases related to the acute or chronic aciivaiion of NF-kB, such as rheumatoid 
arthritis. 

To construct a vector containing the NF-kB promoter element, a PCR based 
strategy is employed. The upstream primer contains four tandem copies of the NF-kB 
binding site (GGGGACTTTCCC) (SEQ ID NO:8), 18 bp of sequence complementary 
to the 5* end of the SV40 early promoter sequence, and is flanked with an Xhol site: 
5 * :<X!GGCCTCG AGGGG ACTTTCCCCH3GG ACTTTCCGGGG ACT 
TTTCCATCCTGCC ATCTC A ATT AG:3 ' (SEQ ID NO:9) 

The downstrearr^primer is complementary to the 3' end of the S V40 promoter 
and is flanked with a Hind III site: 

5 , ;GCGGCAAGCTTTTTGCAAAGCCTAGGC:3 , (SEQ ID NO:4) 

PCR amplification is performed using the S V40 promoter template present in 
the pB-gal:promoter plasmid obtained from Clontcch. The resulting PCR fragment is 
digested with Xhol and Hind in and subcloned into BLSK2-. (Stratagene) 
Sequencing with the T7 and T3 primers confirms the insert contains the following 
sequence: 

S^CTCGAGGGGACTTTCCCGGGGACTTTCCGGGGACTITCCGGGACTTTCC 
ATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCCC 
ATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGA 
CTAAl 1 1 1 1 1 1 l ATTT ATGC AG AGGCCG AGGCCGCCTCGGCCTCTGAGCTA 
TTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAA 
GCTT:3* (SEQ ID NO: 10) 

Next, replace the SV40 minimal promoter element present in the pSEAP2- 
promoter plasmid (Clontcch) with this NF-KB/SV40 fragment using Xhol and 
HindlH. However, this vector does not contain a neomycin resistance gene, and 
therefore, is not preferred for mammalian expression systems. 

In order to generate stable mammalian cell lines, the NF-kB/S V40/SEAP 
cassette is removed from the above NF-kB/SEAP vector using restriction enzymes 
Sail and NotI, and inserted into a vector containing neomycin resistance. Particularly, 
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the NF-xB/S V40/SEAP cassette was insened into pGFP- 1 (Clontech), replacing the 
GFP gene, after restricting pGFP-I with Sail and Notl. 

Once NF-KB/SV40/SEAP/Neo vector is created, stable Jurkat T-cells are 
created and maintained according to the protocol described in Example 13. Similarly, 
the method for assaying supernatants with these stable Jurkat T-cells is also described 
in Example 13. As a positive control, exogenous TNF alpha (0.1,1, 10 ng) is added to 
wells H9, H10, and HI 1, with a 5-10 fold activation typically observed. 

Example 17: Assay foj S EAP Activity 

As a reporter molecule for the assays described in Examples 1 3-16, SEAP . 
activity is assayed using the Tropix Phospho-light Kit (Cat. DP-400) according to the 
following general procedure. The Tropix Phospho-light Kit supplies the Dilution, 
Assay, and Reaction Buffers used below. 

Prime a dispenser with the 2.5x Dilution Buffer and dispense 15 pj of 2.5x 
dilution buffer into Optiplates containing 35 fxl of a supernatant. Seal the plates with 
a plastic sealer and incubate at 65°C for 30 min. Separate the Optiplates to avoid 
uneven heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser 
and prime with the Assay Buffer. Add 50 uJ Assay Buffer and incubate at room 
temperature 5 min. Empty the dispenser and prime with the Reaction Buffer (see the 
table below). Add 50 pi Reaction Buffer and incubate at room temperature for 20 
minutes. Since the intensity of the chemiluminescent signal is time dependent, and it 
takes about 10 minutes to read 5 plates on luminometer, one should treat 5 plates at 
each time and start the second set 10 minutes later. 

Read the relative light unit in the luminometer. Set HI2 as blank, and print 
the results. An increase in chemiluminescence indicates reporter activity. 

Reaction Buffer Formulation: 

# of plates Ran buffer dilueni (ml) CSPD(ml) 

~I0 60 3 

1 1 65 3.25 

12 70 3.5 
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15 



20 



25 
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35 



40 



45 
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75 


3.75 


14 


80 
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15 


85 


4.25 


16 


90 
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95 


4.75 


18 


100 
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19 


105 


5.25 


20 


110 


5.5 


21 


115 


5.75 


22 


120 
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23 


125 


6.25 
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130 


6.5 
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135 


6.75 


26 


140 
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27 


145 


7.25 


28 


150 


7J 


29 


155 


7.75 


30 


160 
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31 


165 


8.25 


32 


170 


8.5 


33 


175 


8.75 


34 


180 
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35 


185 


9.25 


36 


190 


9.5 


37 


195 


9.75 


38 


200 


10 


39 


205 


10.25 


40 


210 


10.5 


41 


215 


10.75 


42 


220 


11 


43 


225 


11.25 


44 


230 


11.5 


45 


235 


11.75 


46 


240 


12 


47 


245 


12.25 


48 


250 


12.5 


49 


255 


12.75 


50 


260 


13 
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Example 18: High-Throughput S creening Assay Identifying Chang e s in Small 
Molecule Concentration and Membrane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
molecules, such as calcium, potassium, sodium, and pH, as well as alter membrane 
potential. These alterations can be measured in an assay to identify supematants 
which bind to receptors of a particular cell. Although the following protocol 
describes an assay for calcium, this protocol can easily be modified to detect changes 
in potassium, sodium, pFj, membrane potential, or any other small molecule which is 
detectable by a fluorescent probe. 

The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") lo ' 
measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be used 
instead of the calcium fluorescent molecule, fluo-4 (Molecular Probes, irxi. ; 
catalog no. F-14202) , used here. 

For adherent cells, seed the cells at 10.000 -20,000 cells/well in a Co-star 
black 96-well plate with clear bottom. The plate is incubated in a CO, incubator for 
20 hours. The adherent cells are washed two times in Biotek washer with 200 ul of 
HBSS (Hank's Balanced Salt Solution) leaving 100 ul of buffer after the final wash. 

A stock solution of 1 mg/ml fluo-4 is made in 10% pluronic acid DMSO. To 
load the cells with fluo-4 , 50 ui of 12 ug/ml (luo-4 is added to each well The plate 
is incubated at 37°C in a C0 2 incubator for 60 min. The plate is washed four times in 
the Biotek washer with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells are 
re-suspended to 2-5xlO G cells/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml 
fluo-4 solution in 10% pluronic acid DMSO is added to each ml of cell suspension. 
The tube is then placed in a 37°C water bath for 30-60 nun. The ceils are washed 
twice with HBSS, resuspended to lx 10* cells/mi, and dispensed into a microplate, 100 
ul/well. The plate is centrifuged at 1000 rpm for 5 min. The plate is then washed 
once in Denley Ceil Wash with 200 ul, followed by an aspiration step to 100 ul final 
volume. 
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For a non-cell based assay, each well contains a fluorescent molecule, such as 
flur>4 . The supernatant is added to the well, and a change in fluorescence is 
detected. 

To measure the fluorescence of intracellular calcium, the FLIPR is set for the 
following parameters; (1) System gain is 300-800 mW; (2) Exposure time is 0.4 
second; (3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; 
and (6) Sample addition is 50 ul. Increased emission at 530 nm indicates an 
extracellular signaling event which has resulted in an increase in the intracellular 
Ca" 1 " 1 " concentration . 

Example 19: High-Throughp ut S creening Assay Identifying Tyrosine Ki n m 
Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of 
transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine 
Kinase RPTK) group are receptors for a range of mitogenic and metabolic growth 
factors including the PDGF, FGF. EGF, NGF, HGF and Insulin receptor subfamilies. 
In addition there are a large family of RPTKs for which the corresponding iigand is 
unknown. Ligands for R?TKs include mainly secreted small proteins, but also 
membrane-bound and extracellular matrix proteins. 

Activation of RPTK by ligands involves ligand-mediated receptor 
dirnerizaiion, resulting in transphosphorylation of the receptor subunits and activation 
of the cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases include 
receptor associated tyrosine kinases of the sre-farnily (e.g., sre, yes. Ick, lyn. fyn) and 
non-receptor linked and cytosolic protein tyrosine kinases, such as the Jak family, 
members of which mediate signal transduction triggered by the cytokine superfamily 
of receptors (e.g., the Intcrlcukins, Interferons. GM-CSF, and Leptin). 

Because of the wide range of known factors capable of stimulating tyrosine 
kinase activity, the identification of novel human secreted proteins capable of 
activating tyrosine kinase signal transduction pathways are of interest. Therefore, the 
following protocol is designed to identify those novel human secreted proteins 
capable of activating the tyrosine kinase signal transduction pathways. 
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Seed target cells (e.g., primary keratinocytes) at a density of approximately 
25,000 cells per well in a 96 well Loprodyne Silent Screen Plates purchased from 
10 Nalge Nunc (Naperville, IL). The plates are sterilized with two 30 minute rinses with 

100% ethanol. rinsed with water and dried overnight. Some plates arc coated for 2 hr 
5 with 100 ml of cell culture grade type I collagen (50 mg/ml), gelatin (2%) or 
polylysine (50 mg/ml), all of which can be purchased from Sigma Chemicals (St. 
Louis, MO) or 10% Matrigel purchased from Becton Dickiason (Bedford,MA), or 
calf serum, rinsed with PBS and stored at 4°C. Cell growth on these plates is assayed 
by seeding 5,000 cells/^vell in growth medium and indirect quantitation of cell 
20 10 number through use of alamarBluc as described by the manufacturer Alamar 

Biosciences, Inc. (Sacramento, CA) after 48 hr. Falcon plate covers #3071 from 
Becton Dickinson (Bedford,MA) are used to cover the Loprodyne Silent Screen 
Plates. Falcon Microtest III cell culture plates can also be used in some proliferation 

25 

expenments. 

15 To prepare extracts, A43 1 cells are seeded onto the nylon membranes of 

Loprodyne plates (20.000/200ml/well) and cultured overnight in complete medium. 
30 Ce,ls arc quiesced by incubation in serum-free basal medium for 24 hr. After 5-20 

minutes treatment with EOF (60ng/mi) or 50 ul of the supernatant produced in 
Example 11, die medium was removed and 100 ml of extraction buffer ((20 mM 
20 HEPES pH 7.5, 0. 15 M NaCl, 1% Triton X-100, 0. 1% SDS, 2 mM Na3V04, 2 mM 
Na4P207 and a cocktail of protease inhibitors (# 18361 70) obtained from 
Boeheringer Mannheim (Indianapolis, IN) is added to each well and the plate is 
shaken on a rotating shaker for 5 minutes at 40C The plate is then placed in a 
vacuum transfer manifold and the extract filtered through die 0.45 mm membrane 
25 bottoms of each well using house vacuum. Extracts are collected in a 96-weli 

catch/assay plate in the bottom of the vacuum manifold and immediately placed on 
ice. To obtain extracts clarified by centrifugation, the content of each well after 
detergent solubilization for 5 minutes, is removed and centrifuged for 15 minutes at 
4°Cat 16,000 xg. 
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Test the filtered extracts for levels of tyrosine kinase activity. Although many 
methods or detecting tyrosine kinase activity are known, one method is described 
here. * 

Generally, the tyrosine kinase activity of a supernatant is evaluated by 
determining its ability to phosphorylate a tyrosine residue on a specific substrate (a 
biotinylated peptide). Biotinylatcd peptides that can be used for this purpose include 
PSK1 (corresponding to amino acids 6-20 of the cell division kinase cdc2-p34) and 
PSK2 (corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for 
a range of tyrosine kinases and are available from Boehringer Mannheim. 

The tyrosine kinase reaction is set up by adding the following components in 
order. First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg 2+ (5mM * 
ATP/50mM MgCi2), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, 
pH7.3, 40 mM bcta-glycerophosphate, ImM EGTA, lOOmM MgCl 2 , 5 mMMnCl2 
0.5 mg/ml BS A), then 5ul of Sodium Vanadate( I mM), and then 5ul of water. Mix the 
components gently and preincubate the reaction mix at 30°C for 2 min. Initial the 
reaction by adding IOul of the control enzyme or the filtered supernatant. 

The tyrosine kinase assay reaction is then terminated by adding 10 ul of 
120mm EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction 
mixture to a microtiter plate (MTP) module and incubating at 37°C for 20 nun. This 
allows the streptavadin coated 96 well plate to associate with die biotinylated peptide. 
Wash the MTP module with 300ul/well of PBS four times. Next add 75 ul of anti- 
phospotyrosine antibody conjugated to horse radish peroxidase(anti-P-Tyr- 

POD(0.5u/nU)) to each well and incubate at 37°C for one hour. Wash the well as 
above. 

Next add lOOui of peroxidase substrate solution (Boehringer Mannheim) and 
incubate at room temperature for at least 5 mins (up to 30 min). Measure the 
absorbance of the sample at 405 nm by using ELISA reader. The level of bound 
peroxidase activity is quantitated using an ELISA reader and reflects the level of 
tyrosine kinase activity. 
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Example 20: High-Throughput Scre e ning Assay ydentifving Phosphoryl ation 
Activity 

10 ^ a potential alternative and/or compliment to the assay of protein tyrosine 

kinase activity described in Example 19, an assay which detects activation 
5 (phosphorylation) of major intracellular signal transduction intermediates can also be 
used. For example, as described below one particular assay can detect tyrosine 
phosphorylation of the Erk-l and Erk-2 kinases. However, phosphorylation of other 
molecules, such as Raf, JNK, p38 MAP, Map kinase kinase (MEK), MEK kinase, 
Src, Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any other 
10 phosphoserine, phosphotyrosine, or phosphothreonine molecule, can be detected by 
substituting these molecules for Erk-l or Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96-well EUSA 
plate with 0. Irnl of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are 
25 then rinsed w "h PBS and blocked with 3% BSA/PBS for I hr at RT. The protein G 

15 plates are then treated with 2 commercial monoclonal antibodies (lOOng/well) against 
Erk-l 

and Erk-2 ( 1 hr at RT) (Santa Cruz Biotechnology). (To detect other molecules, this 
step can easily be modified by substituting a monoclonal antibody detecting any of 
the above described molecules.) After 3-5 rinses with PBS, the plates are stored at 
20 4°C until use. 

35 A43 1 ceils are seeded at 20,000/weJl in a 96-well Loprodyne filterplate and 

cultured overnight in growth medium. The cells are then starved for 48 hr in basal 
medium (DMEM) and then treated with EGF (6ng/well) or 50 ul of the supernatants 
obtained in Example 1 1 for 5-20 minutes. The cells are then solubilized and extracts 
25 filtered directly into the assay plate. 

After incubation with the extract for 1 hr at RT, the wells are again rinsed. As 
a positive control, a commercial preparation of MAP kinase (lOng/well) is used in 
place 

of A431 extract. Plates are then treated with a commercial polyclonal (rabbit) 
30 antibody ( lug/ml) which specifically recognizes the phosphorylated epitope of the 
Erk-l and Erk-2 kinases (1 hr at RT). This antibody is biotinylated by standard 
procedures. The bound polyclonal antibody is then quantitated by successive 
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incubations with Europium-streptavidin and Europium fluorescence enhancing 
reagent in the Wadac DELFIA instrument (time-resolved fluorescence). An increased 
fluorescent signal over background indicates a phosphorylation. 

Ex ample Z\; Method of Determining Alterations in a Gene Corresponding t n a 
Polynucleotide 

RNA isolated from entire families or individual patients presenting with a 
phenorypc of interest (such as a disease) is be isolated. cDNA is then generated from 
these RNA samples usi^g protocols known in the art. (See, Sambrook.) The cDNA 
is then used as a template for PCR, employing primers surrounding regions of interest 
in SEQ ID NO:X. Suggested PCR conditions consist of 35 cycles at 95°C for 30 ' 
seconds; 60-120 seconds at 52-58°C; and 60-120 seconds at 70°C, using buffer 
solutions described in Sidransky, D., et aL, Science 252:706 (1991). 

PCR products are then sequenced using primers labeled at their 5* end with T4 
polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre 
Technologies). The intron-exon borders of selected exons is also determined and 
genomic PCR products analyzed to confirm the results. PCR products harboring 
Suspected mutations is then cloned and sequenced to validate the results of the direct 
sequencing. 

PCR products is cloned into T-tailed vectors as described in Holton, T.A. and 
Graham, M.W., Nucleic Acids Research, 19: 1 156 (1991) and sequenced with T7 
polymerase (United States Biochemical). Affected individuals are identified by 
mutations not present in unaffected individuals. 

Genomic rearrangements are also observed as a method of determining 
alterations in a gene corresponding to a polynucleotide. Genomic clones isolated 
according to Example 2 are nick-translated with digoxigenindeoxy-uridine 5'- 
triphosphate (Boeliringer Manheim), and FISH performed as described in Johnson, 
Cg. et aL, Methods Cell Biol. 35:73-99 (1991). Hybridization with the labeled probe 
is carried out using a vast excess of human cot- 1 DNA for specific hybridization to 
the corresponding genomic locus. 

Chromosomes are counterstained with 4,6-diamino-2-phenylidoIe and 
propidium iodide, producing a combination of C- and R-bands. Aligned images for 
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precise mapping arc obtained using a triple-band filter set (Chroma Technology, 
Brattleboro, VT) in combination with a cooled charge-coupled device camera 
(Photometries, Tucson, AZ) and variable excitation wavelength filters. (Johnson, Cv. 
et al.. Genet Anal. Tech. AppL, 8:75 (1991).) Image collection, analysis and 
chromosomal fractional length measurements are performed using the Kee Graphical 
Program System. (Inovision Corporation, Durham, NC) Chromosome alterations of 
the genomic region hybridized by the probe are identified as insertions, deletions, and 
translocations. These alterations are used as a diagnostic marker for an associated 
disease. 

Example 22: Method of Detecting Abnormal Levels of a Polypeptide in a 
Biological Sample 

A polypeptide of the present invention can be detected in a biological sample, 
and if an increased or decreased level of the polypeptide is detected; this polypeptide 
is a marker for a particular phenotype. Methods of detection are numerous,. and thus, 
it is understood that one skilled in the art can modify the following assay to fit their 
particular needs. 

For example, antibody-sandwich ELIS As are used to detect polypeptides in a 
sample, preferably a biological sample. Wells of a microtiter plate are coated with 
specific antibodies, at a final concentration of 0.2 to 10 ug/ml. The antibodies are 
either monoclonal or polyclonal and are produced by the method described in 
Example 10. The wells are blocked so that non-specific binding of the polypeptide to 
the well is reduced. 

The coated wells are then incubated for > 2 hours at RT with a sample 
containing the polypeptide. Preferably, serial dilutions of the sample should be used 
to validate results. The plates are then washed three times with deionized or distilled 
water to remove unbounded polypeptide. 

Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a 
concentration of 25-400 ng f is added and incubated for 2 hours at room temperature. 
The plates are again washed three times with deionized or distilled water to remove 
unbounded conjugate. 
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Add 75 ul of 4-methylumbelliferyl phosphate (MUP) ur p-nitrophenyl 
phosphate (NPP) substrate solution to each well and incubate 1 hour at room 
temperature. Measure the reaction by a microtiter plate reader Prepare a standard 
curve, using serial dilutions of a control sample, and plot polypeptide concentration 
on the X-axis (log scale) and fluorescence or absorbance of the Y-axis (linear scale). 
Interpolate the concentration of the polypeptide in the sample using the standard 



Ex ample 23; Formulating a Polypeptide 

0 The secreted polypeptide composition will be formulated and dosed in a 

fashion consistent with good medical practice, taking into account the clinical 
condition of the individual patient (especially the side effects of treatment with the 
secreted polypeptide alone), the site of delivery, the method of administration, the 
scheduling of administration, and other factors known to practitioners. The "effective 
5 amount" for purposes herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutical^ effective amount of 
secreted polypeptide administered parenterally per dose will be in the range of about 1 
Jig/kg/day to 10 mg/kg/day of patient body weight, although, as noted above, this will 
be subject to therapeutic discretion. More preferably, this dose is at least 0.01 
mg/kg/day, and most preferably for humans between about 0.01 and I mg/kg/day for 
the hormone. If given continuously, the secreted polypeptide is typically 
adnunistered at a dose rate of about 1 u.g/kg/hour to about 50 ug/kg/hour, either by 1- 
4 injections per day or by continuous subcutaneous infusions, for example, using a 
mini-pump. An intravenous bag solution may also be employed. The length of 
treatment needed to observe changes and the interval following treatment for 
responses to occur appears to vary depending on the desired effect. 

Pharmaceutical compositions containing the secreted protein of the invention 
are administered orally, rectally, parenterally, intracistemally, intravaginally. 
intraperitoneally, topically (as by powders, ointments, gels, drops or transdermal 
patch), hucally. or as an oral or nasal spray. "PharmaceuUcally acceptable carrier- 
refers to a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material or 
formulation auxiliary of any type. The term "parenteral" as used herein refers to 
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modes of administration which include intravenous, intramuscular, intraperitoneal, 
intrasternal, subcutaneous and intraarticular injection and infusion. 

The secreted polypeptide is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semi- 
permeable polymer matrices in the form of shaped articles, e.g., films, or 
mirocapsules. Sustained- release matrices include polylactides (U.S. Pat. No. 
3,773,919, EP 58,45 1), copolymers of L-glutamic acid and gamma-ethyl-L-glutamate 
(Sidman, U. et al., Biopolyuiers 22:547-556 (1983)), poly (2- hydroxyethyl 
methacryiate) (R. Lange^et al., J. Biomed. Mater. Res. 15:167-277 (1981), and R. 
Langer, Chem. Tech. 12:98-105 (1982)), ethylene vinyl acetate (R. Langer et al.) or 
poly-D- (-)-3-hydroxybutyric acid (EP 133,988). Sustained-release compositions' 
also include liposomally entrapped polypeptides. Liposomes containing the secreted 
polypeptide are prepared by methods known per se: DE 3,218,121; Epstein et al., 
Proc. Natl. Acad. Sci. USA 82:3688-3692 (1985); Hwang et al., Proc. Nad. Acad. Sci. 
USA 77:4030-4034 ( 1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 
142,641; Japanese Pat. Appl. 83-1 18008; U.S. Pat. Nos. 4,485,045 and 4,544,545; and 
EP 102,324. Ordinarily, the liposomes are of the small (about 200-800 Angstroms) 
uni!amei:ar type in which the lipid content is greater than about 30 mol. percent 
cholesterol, the selected proportion being adjusted for the optimal secreted 
polypeptide therapy. 

For parenteral administration, in one embodiment, the secreted polypeptide is 
formulated generally by mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion), with a pharmaceutically 
acceptable carrier, i.e. t one that is non-toxic to recipients at the dosages and 
concentrations employed and is compatible with other ingredients of the formulation. 
For example, the formulation preferably does not include oxidizing agents and other 
compounds that are known to be deleterious to polypeptides. 

Generally, the formulations are prepared by contacting the polypeptide 
uniformly and intimately with liquid carriers or finely divided solid carriers or both. 
Then, if necessary, the product is shaped into the desired formulation. Preferably the 
carrier is a parenteral carrier, more preferably a solution that is isotonic with the blood 
of the recipient. Examples of such carrier vehicles include water, saline, Rineer's 
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solution, and dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl 
oleate are also useful herein, as well as liposomes. 
10 ^ carrier suitably contains minor amounts of additives such as substances 

that enhance isotonicity and chemical stability. Such materials are non-toxic to 
5 recipients at the dosages and concentrations employed, and include buffers such as 
phosphate, citrate, succinate, acetic acid, and other organic acids or their salts; 
antioxidants such as ascorbic acid; low molecular weight (less than about ten 
residues) polypeptides, e.g., polyarginine or tripeptides; proteins, such as serum 
albumin, gelatin, or immunoglobulins; hydrophilic polymers such as 
20 10 Polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, aspartic acid, or 

arginine ; monosaccharides, disaccharides, and other carbohydrates including cellulose 
or its derivatives, glucose, manose. or dextrins; chelating agents such as EDTA; sugar 
alcohols such as rnannitol or sorbitol; counterions such as sodium; and/or nonionic 
25 surfactants such as polysorbates. poloxamers, or PEG. 

15 The secreted polypeptide is typically formulated in such vehicles at a 

concentration of about 0. 1 mg/ml to 100 mg/ml, preferably 1 -10 nig/ml, at a pH of 
about 3 to 8. It will be understood that the use of certain of the foregoing excipients, 
carriers, or stabilizers will result in the formation of polypeptide salts. 

Any polypeptide to be used for therapeutic administration can be sterile. 
Sterility is readily accomplished by filtration through sterile filtration membranes 
35 (eg., 0.2 micron membranes). Therapeutic polypeptide compositions generally are 

placed into a container having a sterile access port, for example, an intravenous 
solution bag or vial having a stopper pierceable by a hypodermic injection needle. 
Polypeptides ordinarily will be stored in unit or multi-dose containers, for 
25 example, sealed ampoules or vials, as an aqueous solution or as a lyophilizcd 

formulation for reconstitution. As an example of a lyophilized formulation, 10-ml 
vials are filled with 5 ml of sterile-filtered 1% (w/v) aqueous polypeptide solution, 
45 md the resuJtin S mixture is lyophilized. The infusion solution is prepared by 

reconstituting the lyophilized polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Associated with such containers) can be a notice in 
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the form prescribed by a governmental agency regulating the manufacture, use or sale 
of pharmaceuticals or biological products, which notice reflects approval by the 
agency of manufacture, use or sale for human administration. In addition, the 
polypeptides of the present invention may be employed in conjunction with other 
therapeutic compounds. 

Example 24: Method of Treating De creased Levels of the Pnlvp epfir^ 

It will be appreciated that conditions caused by a decrease in die standard or 
normal expression level of a secreted protein in an individual can be treated by 
administering the polypeptide of the present invention, preferably in the secreted 
form. Thus, the invention also provides a method of treatment of an individual in ' 
need of an increased level of the polypeptide comprising administering to such an 
individual a pharmaceutical composition comprising an amount of the polypeptide to 
increase the activity level of the polypeptide in such an individual. 

For example, a patient with decreased levels of a polypeptide receives a daily 
dose 0. 1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the 
polypeptide is in the secreted form. The exact details of the dosing scheme, based on 
ad-r&straticr. ar.d formulation, are provided in Example 23. 

Example 25: Methnd of Treating Increased levels of the Polype ptide 

Antisense technology is used lo inhibit production of a polypeptide of the 
present invention. This technology is one example of a method of decreasing levels 
of a polypeptide, preferably a secreted form, due to a variety of etiologies, such as 
cancer. 

For example, a patient diagnosed with abnormally increased levels of a 
polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 
2.0 and 3.0 mg/kg day for 2 1 days. This treatment is repeated after a 7-day rest 
period if the treatment was well tolerated. The formulation of the antisense 
polynucleotide is provided in Example 23. 

Example 26 : Method of Treatment IJsin g Gene Therapy 
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One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a 
10 subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 

separated into small pieces. Small chunks of the tissue are placed on a wet surface of 
5 a tissue culture flask, approximately ten pieces are placed in each flask. The flask is 
turned upside down, closed tight and left at room temperature over night. After 24 
hours at room temperature, the flask is inverted and the chunks of tissue remain fixed 
to the bottom of the flask and fresh media (e.g.. Ham's Fl 2 media, with 10% FBS, 
penicillin and streptomycin) is added. The flasks are then incubated at 37°C for 
2Q 10 approximately one week. 

At this time, fresh media is added and subsequently changed every several 
days. After an additional two weeks in culture, a monolayer of fibroblasts emerge. 
The monolayer is trypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier. P.T. et al., DNA, 7:219-25 (1988)), flanked by the long 
terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and 
HindHI and subsequently treated with calf intestinal phosphatase. The linear vector is 
fractionated on agarose gel and purified, using glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified 
using PCR primers which correspond to the 5* and 3* end sequences respectively as set 
forth in Example 1 . Preferably, the 5' primer contains an EcoRI site and the 3' primer 
35 includes a Hindin site. Equal quantities of the Moloney murine sarcoma virus linear 

backbone and the amplified EcoRI and HindUI fragment are added together, in the 
presence of T4 DNA ligase. The resulting mixture is maintained under conditions 
appropriate for ligation of the two fragments. The ligation mixture is then used to 
25 transform bacteria HB 101 , which arc then plated onto agar containing kanamycin for 
the purpose of confirming that the vector has the gene of interest properly inserted. 
The amphotropic pA317 or GP+aml2 packaging cells are grown in tissue 
45 culture to confluent density in Duibecco's Modified Eagles Medium (DMEM) with 

10% calf serum (CS) t penicillin and streptomycin. The MSV vector containing the 
30 gene is then added to the media and the packaging cells transduced with the vector. 
The packaging cells now produce infectious viral particles containing the gene (the 
50 packaging ceils are now referred to as producer cells). 
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Fresh media is added to the transduced producer cells, and subsequently, the 
media is harvested from a 10 cm plate of confluent producer cells. The spent media, 
containing the infectious viral particles, is filtered through a milliporc filter to remove 
detached producer ceils and this media is then used to infect fibroblast cells. Media is 
removed from a sub-confluent plate of fibroblasts and quickly replaced with the 
media from the producer cells. This media is removed and replaced with fresh media. 
If the titer of virus is high, then virtually all fibroblasts will be infected and no 
selection is required. If the titer is very low, then it is necessary to use a retroviral 
vector that has a selectable marker, such as neo or his. Once the fibroblasts have been 
efficiently infected, the fibroblasts are analyzed to determine whether protein is - 
produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or 
after having been grown to confluence on cytodex 3 microcarrier beads. 
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Example 27: Method of Treatment Using Gene Thcranv - In Vivn 

Another aspect of the present invention is using in vivo gene therapy methods 
to treat disorders, diseases and conditions. The gene therapy method relates to the 
introduction of naked nucleic acid (DNA, RNA, and antisense DNA or RNA) 
sequences into an animal to increase or decrease the expression of the polypeptide. 
The polynucleotide of the present invention may be operatively linked to a promoter 
or any other genetic elements necessary for the expression of the polypeptide by the 
target tissue. Such gene therapy and delivery techniques and methods are known in 
the art, see, for example, WO90/I1092, W098/1 1779; U.S. Patent NO. 5693622, 
5705151, 5580859; Tabata H. etal. (1997) Cardiovasc. Res. 35(3):470-479, Chao Jet 
al. (1997) Pharmacol. Res. 35(6):5 17-522, Wolff J.A. (1997) Neuromuscul. Disord. 
7(5):3I4-318, Schwartz B. et al. (1996) Gene Ther. 3(5):405-4I 1, Tsurumi Y. et al. 
(1996) Circulation 94(12):3281-3290 (incorporated herein by reference). 

The polynucleotide constructs may be delivered by any method thai delivers 
injectable materials to the cells of an animal, such as, injection into the interstitial 
space of tissues (heart, muscle, skin, lung, liver, intestine and the like). The 
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polynucleotide constructs can be delivered in a pharmaceutical ly acceptable liquid or 
aqueous carrier. 

10 The term "naked 11 polynucleotide, DNA or RNA, refers to sequences that are 

free from any delivery vehicle that acts to assist promote, or facilitate entry into the 
5 cell, including viral sequences, viral particles, liposome formulations, lipofectin or 
precipitating agents and the like. However, the polynucleotides of the present 
invention may also be delivered in liposome formulations (such as those taught in 
Feigner P L. et al. (1995) Ann. NY Acad. Sci. 772:126-139 and Abdallah B. et a!. 
(1995) Biol. Cell 85(1): 1-7) which can be prepared by methods well known to those 
10 skilled in the art. 

The polynucleotide vector constructs used in the gene therapy method are 
preferably constructs that will not integrate into the host genome nor will they contain 
sequences that allow tor replication. Any strong promoter known to those skilled in 
the art can be used for driving the expression of DNA. Unlike other gene therapies 
techniques, one major advantage of introducing naked nucleic acid sequences into 
target cells is the transitory nature of the polynucleotide synthesis in the cells. Studies 
have shown that non-replicating DNA sequences can be introduced into cells to 
provide production of the desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the interstitial space of 
20 tissues within the an animal, including of muscle, skin, brain, lung, liver, spleen, bone 
35 marrow, thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, 

stomach, intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and 
connective tissue. Interstitial space of the tissues comprises the intercellular fluid, 
mucopolysaccharide matrix among the reticular fibers of organ tissues, elastic fibers 
25 in the walls of vessels or chambers, collagen fibers of fibrous tissues, or that same 
matrix within connective tissue ensheathing muscle cells or in the lacunae of bone. It 
is similarly the space occupied by the plasma of the circulation and the lymph fluid of 
45 thc Emphatic channels. Delivery to the interstitial space of muscle tissue is preferred 

for the reasons discussed below. They may be conveniently delivered by injection 
into the tissues comprising these cells. They are preferably delivered to and 
expressed in persistent, non-dividing cells which are differentiated, although delivery 
and expression may be achieved in non-differentiated or less completely 
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differentiated cells, such as, for example, stem cells of blood or skin fibroblasts. In 
vivo muscle cells are particularly competent in their ability to take up and express 
polynucleotides. 

For the naked polynucleotide injection, an effective dosage amount of DNA or 
RNA win be in the range of from about 0,05 g/kg body weight to about 50 mg/kg 
body weight. Preferably the dosage will be from about 0.005 mg/kg to about 20 
mg/kg and more preferably from about 0.05 mg/kg to about 5 mg/kg. Of course, as 
the artisan of ordinary skill will appreciate, this dosage will vary according to the 
tissue site of injection. The appropriate and effective dosage of nucleic acid sequence 
can readily be determined by those of ordinary skill in the art and may depend on the 
condition being treated and the route of administration. The preferred routtf of 
administration is by the parenteral route of injection into the interstitial space of 
tissues. However, other parenteral routes may also be used, such as, inhalation of an 
aerosol formulation particularly for delivery to lungs or bronchial tissues, throat or 
mucous membranes of the nose. In addition, naked polynucleotide constructs can be 
delivered to arteries during angioplasty by the catheter used in the procedure. 

The dose response effects of injected polynucleotide in muscle in vivo is 
determined as follows. Suitable template DNA for production of mRNA coding for 
polypeptide of the present invention is prepared in accordance with a standard 
recombinant DNA methodology. The template DNA : which may be either circular or 
linear, is either used as naked DNA or complexed with liposomes. The quadriceps 
muscles of mice are then injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1 .5 cm incision is made on 
the anterior thigh, and the quadriceps muscle is directly visualized. The template 
DNA is injected in 0. 1 ml of carrier in a 1 cc syringe through a 27 gauge needle over 
one minute, approximately 0.5 cm from the distal insertion site of the muscle into the 
knee and about 0.2 cm deep. A suture is placed over the injection site for future 
localization, and the skin is closed with stainless steel clips. 

After an appropriate incubation time (e.g., 7 days) muscle extracts are 
prepared by excising the entire quadriceps. Every fifth 15 urn cross-section of the 
individual quadriceps muscles is histochemically stained for protein expression. A 
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time course for protein .expression may be done in a similar fashion except that 
quadriceps from different mice are harvested at different times. Persistence of DNA 
in muscle following injection may be determined by Southern blot analysis after 
preparing total cellular DNA and HIRT supematants from injected and control mice. 
The results of the above experimentation in mice can be use to extrapolate proper 
dosages and other treatment parameters in humans and other animals using naked 
DNA. 

Exampl e 28: Transgenic Animals. 

The polypeptides of the invention can also be expressed in transgenic animals. 
Animals of any species, including, but not limited to, mice, rats, rabbits, hamsters, 
guinea pigs, pigs, micro-pigs, goats, sheep, cows and non-human primates, e.g., 
baboons, monkeys, and chimpanzees may be used to generate transgenic animals. In a 
specific embodiment, techniques described herein or otherwise known in the art, are 
used to express polypeptides of the invention in humans, as part of a gene therapy 
protocol. 

Any technique known in the art may be used to introduce the transgene (i.e., 
polynucleotides of the invention) into animals to produce the founder lines of 
transgenic animals. Such techniques include, but are not limited to, pronuclear 
microinjection (Paterson et al., Appl. Microbiol. Biotechnol. 40:691-698 (1994); 
Carver et al., Biotechnology (NY) 1 1:1263-1270 (1993); Wright ctal., Biotechnology 
(NY) 9:830-834 (1991); and Hoppc et al., U.S. Pat. No. 4,873,191 (1989)); retrovirus 
mediated gene transfer into germ lines (Van der Putten et al., Proc. Natl. Acad. Sci., 
USA 82:6148-6152 (1985)), blastocysts or embryos; gene targeting in embryonic 
stem cells (Thompson et al., Cell 56:313-321 (1989)); clcctroporation of cells or 
embryos (Lo, 1983, Mol Cell. Biol. 3:1803-1814 (1983)); introduction of the 
polynucleotides of the invention using a gene gun (see, e.g., Ulmer et al., Science 
259:1745 (1993); introducing nucleic acid constructs into embryonic pleuripotcnt 
stem cells and transferring the stem cells back into the blastocyst; and sperm- 
mediated gene transfer (Lavitrano et al., Cell 57:717-723 (1989); etc. For a review of 
such techniques, see Gordon, "Transgenic Animals," Intl. Rev. Cytol. 1 15:171-229 
(1989), which is incorporated by reference herein in its entirety. 
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Any technique known in the art may be used to produce transgenic clones 
containing polynucleotides of the invention, for example, nuclear transfer into 
enucleated oocytes of nuclei from cultured embryonic, fetal, or adult cells induced to 
quiescence (Campell et ah, Nature 380:64-66 (1996); Wilmut et al., Nature 385:810- 
813(1997)). 

The present invention provides for transgenic animals that carry the transgene 
in all their cells, as well as animals which carry the transgene in some, but not all their 
cells, i.e., mosaic animals or chimeric. The transgene may be integrated as a single 
transgene or as multicopies such as in concatamers, e.g., head-to-head tandems or 
head-to-tail tandems. The transgene may also be selectively introduced into .and 
activated in a particular cell rype by following, for example, the teaching of Lasto et 
al. (Lasko et al., Proc. Natl. Acad. Sci. USA 89:6232-6236 (1992)). The regulatory 
sequences required for such a cell-type specific activation will depend upon the 
particular cell type of interest, and will be apparent to those of skill in the art. When 
it is desired that the polynucleotide transgene be integrated into the chromosomal site 
of the endogenous gene, gene targeting is preferred. Briefly, when such a technique is 
to be utilized, vectors containing some nucleotide sequences homologous to the 
endogenous gene are designed for the purpose of integrating, via homologous 
recombination with chromosomal sequences, into and disrupting the function of the 
nucleotide sequence of the endogenous gene. The transgene may also be selectively 
introduced into a particular cell type, thus inactivating the endogenous gene in only 
that cell type, by following, for example, the teaching of Gu et al. (Gu et aU Science 
265:103-106 (1994)). The regulatory sequences required for such a cell-type specific 
inactivation will depend upon the particular cell type of interest, and will be apparent 
to those of skill in the art 

Once transgenic animals have been generated, the expression of the 
recombinant gene may be assayed utilizing standard techniques. Initial screening 
may be accomplished by Southern blot analysis or PCR techniques to analyze animal 
tissues to verify that integration of the transgene has taken place. The level of mRNA 
expression of the transgene in the tissues of the transgenic animals may also be 
assessed using techniques which include, but are not limited to, Northern blot analysis 
of tissue samples obtained from the animal, in situ hybridization analysis, and reverse 
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transcriplase-PCR (rt-PCR). Samples of transgenic gene-expressing tissue may also 
be evaluated immunocytochemically or immunohistochemically using antibodies 
specific for the trans gene product 

Once the founder animals are produced, they may be bred, inbred, outbrcd, or 
crossbred to produce colonies of the particular animal. Examples of such breeding 
strategies include, but are not limited to: outbreeding of founder animals with more 
than one integration site in order to establish separate lines; inbreeding of separate 
lines in order to produce compound transgenics that express the transgene at higher 
levels because of the effects of additive expression of each transgene; crossing of 
heterozygous transgenic animals to produce animals homozygous for a given 
integration site in order to both augment expression and eliminate the need for 
screening of animals by DNA analysis; crossing of separate homozygous lines to 
produce compound heterozygous or homozygous lines; and breeding to place the 
transgene on a distinct background that is appropriate for an experimental mode! of 
interest. 

Transgenic animals of the invention have uses which include, but are not 
limited to ; animal model systems useful in elaborating the biological function of 
polypeptides of the present invention, studying conditions and/or disorders associated 
with aberrant expression, and in screening for compounds effective in ameliorating 
such conditions and/or disorders. 

Example 29: Knock-Out Animals. 

Endogenous gene expression can also be reduced by inactivating or "knocking 
out" the gene and/or its promoter using targeted homologous recombination. (E.g., 
see Smithies et al., Nature 317:230-234 (1985); Thomas & Capecchi, Cell 51:503- 
512 (1987); Thompson et al. t Cell 5:313-321 (1989); each of which is incorporated by 
reference herein in its entirety). For example, a mutant, non- functional 
polynucleotide of the invention (or a completely unrelated DNA sequence) flanked by 
DNA homologous to the endogenous polynucleotide sequence (either the coding 
regions or regulatory regions of the gene) can be used, with or without a selectable 
marker and/or a negative selectable marker, to transfect cells that express 
polypeptides of the invention in vivo. In another embodiment, techniques known in 
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the art are used to generate knockouts in cells that contain, but do not express the gene 
of interest. Insertion of the DNA construct, via targeted homologous recombination, 
results in inactivation of the targeted gene. Such approaches are particularly suited in 
research and agricultural fields where modifications to embryonic stem cells can be 
used to generate animal offspring with an inactive targeted gene {e.g., see Thomas & 
Capecchi 1987 and Thompson 1989, supra). However this approach can be routinely 
adapted for use in humans provided the recombinant DNA constructs are directly 
administered or targeted to the required site in vivo using appropriate viral vectors that 

will be apparent to those of skill in the art. 

f 

In further embodiments of the invention, cells that are genetically engineered 
to express the polypeptides of the invention, or alternatively, that are genetically 
engineered not to express the polypeptides of the invention (e.g., knockouts) are 
administered to a patient in vivo. Such cells may be obtained from the patient (i.e., 
animal, including human) or an MHC compatible donor and can include, but are not 
limited to fibroblasts, bone marrow cells, blood cells ( e.g. , lymphocytes), adipocytes, 
muscle cells, endothelial cells etc. The cells are genetically engineered in vitro using 
recombinant DNA techniques to introduce the coding sequence of polypeptides of the 
invention into the cells, or alternatively, to disrupt the coding sequence and/or 
endogenous regulatory sequence associated with the polypeptides of the invention, 
e,g., by transduction (using viral vectors, and preferably vectors that integrate the 
transgene into the cell genome) or transfection procedures, including, but not limited 
to, the use of plasmids, cosmids, YACs, naked DNA, electroporation, liposomes, etc. 
The coding sequence of the polypeptides of the invention can be placed under the 
control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 
expression, and preferably secretion, of the polypeptides of the invention. The 
engineered cells which express and preferably secrete the polypeptides of the 
invention can be introduced into the patient systemically, e.g., in the circulation, or 
intraperitoneally. 

Alternatively, the cells can be incorporated into a matrix and implanted in the 
body* genetically engineered fibroblasts can be implanted as part of a skin graft; 
genetically engineered endothelial cells can be implanted as part of a lymphatic or 
vascular graft. (See, for example, Anderson et al. U.S. Patent No. 5,399,349; and 
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Mulligan & Wilson, U.S. Patent No. 5,460,959 each of which is incorporated by 
reference herein in its entirety). 

When the cells to be administered are non-autologous or non-MHC 
compatible cells, they can be administered using well known techniques which 
prevent the development of a host immune response against the introduced cells. For 
example, the cells may be introduced in an encapsulated form which, while allowing 
for an exchange of components with the immediate extracellular environment, does 
not allow (he introduced cells to be recognized by the host immune system. 

Transgenic and "knock-out" animals of the invention have uses which include, 
but are not limited to, animal model systems useful in elaborating the biological 
function of polypeptides of the present invention, studying conditions and/or disorders 
associated with aberrant expression, and in screening for compounds effective in 
ameliorating such conditions and/or disorders. 

It will be clear that the invention may be practiced otherwise than as 
particularly described in the foregoing description and examples. Numerous 
modifications and variations of the present invention are possible in light of the above 
teachings and, therefore, are within the scope of the appended claims. 

The entire disclosure of each document cited (including patents, patent 
applications, journal articles, abstracts, laboratory manuals, books, or other 
disclosures) in the Background of the Invention, Detailed Description, and Examples 
is hereby incorporated herein by reference. Further, the hard copy of the sequence 
listing submitted herewith and the corresponding computer readable form are both 
incorporated herein by reference in their entireties. 



WO 00/06698 



357 



PCT/LS99/17130 



Applicant's or item's file dt/viio/t JlnlernauonaJ application Ni 

reference number rZO Jl HLi I I 



U reassigned 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 
(PCTRulc Mbis) 



A- The indications made below relate 10 the microorganism ref erred to in thedescription 
«n page 259 y ne N/A 



B. IDENTIFICATION OFDEPOSIT 



Further deposits arc identified on an additional sheet | [ 



Nomcofdcpojitary institution American Type Culture Collection 

r 



Address of depositary institution (including postal code and canary) 

10801 University Boulevard 
Manassas, Virginia 201 1 0-2209 
United States of America 



Date of deposit 




Accession Number 






July 27, 1998 




203070 



C ADDITIONAL INDICATIONS jitavt blank if not applicable) This information is continued on «i«dditio,«d sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE{if the mdywioas are not for all designated States} 



E. SEPARATE FURNISHING OF INDICATIONS Ueaveblank if not applicable) 

i^?W Ied bdOW Wi " 5Ubmi,tcd l ° ,hc Intcn,ational Burc * u later f Wfr rmnur ofifu tndicaacns e.^ 



J^j "This sheet was recei ved with the intematioti al application 


1 1 Th" sheet was recei ved by the International Bureau on: 


Authorized officer 

//Zsd rvette E-Slmms 

(// PCT ii^rrv^jwal Dwston 


Auihorizedofficcr 



Form PCT/RO/I34 (July 1992) 



WO 00/06698 



358 



PCT/US99/I7130 



Applicant's or agent's file 
reference number 



PZ031PCT 



International application Ni 



Unassigned 



10 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuIe Ubis) 



50 



Date of deposit 

July 27. 1998 


Accession Number 

203069 


C ADDmONALWDlCATIONSflwwMani^^i^l, 


') This information is continued on an additional sheet | | 



A The indications mode below relate to the rnicrooreaiusn referred to in thedescriplion 



on page 



a IDENTIFICATION OF DEPOSIT 



Ntmo of depositary institution American Type Culture Coflection 



Funhcr deposits are identificdon an additional sheet | | 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 1 0-2209 
United States of America 



IX DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE/,/^«rf^non, a „„ w /ar ff /W«^S/« 



SEPARATE FURNISHING OFlNDICATIONSffcowManJt^ 



not applicable) 



'JSi^O^r faCl ° W ^ SUtm,ittCd ,Q thC !ntemational Bureau ] ™ <^^S™ral B amreofcht l ndxa**u, g : 



Forreceiving Office use only 



This sheet was received with the i 



international application 



Authorized ofrii 




Yvette E. Simms 

PCT latern'anAnai Piv'StP.i 



BPCT/R0/I34(luly 1992) 



For International Bureau use only 



I 1 This sheet was received by the International Bureau o 



Authorized officer 



WO 00/06698 



359 



PC17US99/17130 



Applicant's or agent's file 
reference number 



P2031 PCT 



IntemanonaJ application Nt. 



Unassigned 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 
(PCT Rule 136u) 

A. Tlic indications made below relate lothe microorganism referred to in the description 

on page 253 Boe N/A 

B. IDENTIFICATION OF DEPOSIT f^^^i^m^ ^u^l^ Q 

Name of depository imtuuoon American Typo Culture Collection 

£ ' 

Address of depositary institution (indudmg postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit 




Accession Number 








July 27. 1998 




203071 





C ADDITIONAL ^ICATIONS/WM^^qp/^^ This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE M^Omthei^catw^notforaUdenp^States} 



E. SEPARATEFURNISHING OF INDIC ATlONSfleaveblankifnat^tpplicabte) 



N^'^fr^r* Wi " SUbttUtICd 10 thc Intc ™ 1 ^' Bureau later (specify the ^ndtmsn of the indication, eg, *Acc*sm 



For recei ving Office use onl y 



This sheet was received with thc international application 



Aiiihorizedofjgc£t4 

frr~ YvettetSimms 

(/ PCT tafcnvtftona! QMslo* 



For International Bureau use only 



□ 

This sheet wa 5 received by the International Burcauon: 



Authorized officer 



Form PCT/RO/134 (July 1992) 



WO 00/06698 



360 



PCT/US99/I7130 



Applicants or agent's file 
reference number 



PZD31 PCT 



International application Nc 



Unasstgrted 



10 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 
(PCT Rule mis) 



15 



20 



25 



30 



35 



40 



45 



50 



A. The indications made below relate to die microorganisratefcrredtointhedescripuon 



on page 



.line 



& IDENTIFICATION OF DEPOSIT 



Further deposits arc identified on an tidditionai sheet [ | 



Nome of depositary institution American Type Culture Collection 



Address of depositary institution (iiuJuding postal code and country} 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Dateot'dcposh 



June 26. 1998 



Accession Number 



203027 



C ADDITIONAL. MDlCATlOt&faavtbknkif not applicable) This information is continued on an additional jhcet Q 



a DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the iadkaiiaw arenotforalidesignaudStates) 



E, SEPARATE FURNISHING OF WDIC ATlOSS(Uavebtankifaoiapplicablcl 



N^^D^r™ bCtoW facaub '" iUcU tU thc ' ntcroatiQnal Bmx later Up&togmmtmmitfto&i aiig*,*. 'Accession 



ForrcceivingOfficcuseonly 



This sheet was received wich theintemationa) application 



Yvette E. Storms 

PCT U^m^^ial OMstoA 



i PCT/RO/134 {July 1992) 



Furlruemauonal Bureau use only 



I I This sheet was recci ved by ihc International Bureau on: 



Authorized officer 



55 



WO 00/06698 



361 



PO7US99/17130 



Applicant s or agent's file 
reference number 



PZ031PCT 



International application N> 



Unassigned 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 
(PCT Rule I3Wj) 



A TheindicationsmittiebelowrelH^ 

on page 243 Iine N/A 



a IDENTlFICATIONOFDEPQSrr 



Furthtrdeposits arc identified on an additional sheet [ | 



Name of depositary institution American Type Culture Collection 
___ r 



Address of depositary institution (inctuavig postal code end country) 

10801 University Boulevard 
Manassas. Virginia 20110-2209 
United Stales of America 



Dateof deposit 



June 11. 1998 



Accession Number 



209965 



C ADDITIONAL INDICATIONS^* bhmk if not applicable) This information is continued on on additional shew [J 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MM>mthe Canons art mt^aUdcsitaciidSta^y 



E. SEPARATEFURNISHING OFINDIC^TIONSftowMuik^TO/iVpteaWe) 



The indications 
Number of Deposit") 



lifted below will b* submitted to the International Bureau later (iptcify che gencml nam ufthe indxanaxs ^ * 



For receiving Onice use only 



This sheet was received with the international application 



hori^^r 



Yvette E. Simms 
PCT !nSs?maiNiR( 



Form PCT/RO/134 (July 1992) 



For International Bureau useonly 



I I This sheet was received by the International Bureau on; 



Authorized officer 



Claims 



5 



10 



15 



20 



25 



30 



35 



40 



45 



55 



WO 00/rt6698 



362 



PCT/US99/17130 



What Is Claimed Is: 



1 . An isolated nucleic acid molecule comprising a polynucleotide having 
a nucleotide sequence at least 95% identical to a sequence selected from the group 
consisting of: 

(a) a polynucleotide fragment ofSEQ ID NO:X or a polynucleotide fragment 
of the cDNA sequence included in ATCC Deposit No:Z. which is hybridizabie to 
SEQIDNOX; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID NO:Y or a 
polypeptide fragment encoded by the cDNA sequence included in ATCC Deposit 
No:Z, which is hybridizabie to SEQ ID NO:X; 

(c) a polynucleotide encoding a polypeptide domain of SEQ ID NO:Y or a 
polypeptide domain encoded by the cDNA sequence included in ATCC Deposit 
No:Z, which is hybridizabie to SEQ ID NO:X; 

(d) a polynucleotide encoding a polypeptide epitope of SEQ ID NO: Y or a 
polypeptide epitope encoded by the cDNA sequence included in ATCC Deposit 
No:Z r which is hybridizabie to SEQ ID NO:X; 

(e) a polynucleotide encoding a polypeptide of SEQ ID NO: Y or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizabie to SEQ ID NO:X t 
having biological activity; 

(f) a polynucleotide which is a variant of SEQ ID NO:X; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO:X; 

(h) a polynucleotide which encodes a species homologue of the SEQ ID 

NO:Y; 

(i) a polynucleotide capable of hybridizing under stringent conditions to any 
one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not 
hybridize under stringent conditions to a nucleic acid molecule having a nucleotide 
sequence of only A residues or of only T residues. 

2. The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding a secreted 
protein. 
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3. The isolated nucleic acid molecule of claim I, wherein the 
10 polynucleotide fragment comprises a nucleotide sequence encoding the sequence 

identified as SEQ ID NO: Y or the polypeptide encoded by the cDMA sequence 
included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X. 



15 



20 



30 



4. The isolated nucleic acid molecule of claim 1 . wherein the 
polynucleotide fragment comprises the entire nucleotide sequence of SEQ ID NO:X 
or the cDN A sequence included in ATCC Deposit No:Z, which is hybridizable to 
SEQ ID NO:X. 



5. The isolated nucleic acid molecule of claim 2, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the 
25 N-terminus. 



6, The isolated nucleic acid molecule of claim 3, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the 
N-ierminus. 



7. A recombinant vector comprising the isolated nucleic acid molecule of 
35 claim 1. 

8. A method of making a recombinant host cell comprising the isolated 
nucleic acid molecule of claim 1 . 

40 

9. A recombinant host cell produced by the method of claim 8. 

45 ,0 - The recombinant host cell of claim 9 comprising vector sequences. 

11. An isolated polypeptide comprising an amino acid sequence at least 
95% identical to a sequence selected from the group consisting of: 

50 
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(a) a polypeptide fragment of SEQ ID NO:Y or the encoded sequence 
included in ATCC Deposit No:Z; 

(b) a polypeptide fragment of SEQ ID NO:Y or the encoded sequence 
included in ATCC Deposit No:Z, having biological activity; 

(c) a polypeptide domain of SEQ ID NO:Y or the encoded sequence included 
in ATCC Deposit No:Z; 

(d) a polypeptide epitope of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No:Z; 

(e) a secreted form of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(0 a full length protein of SEQ ID NO:Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(g) a variant of SEQ ID NO: Y; 

(h) an allelic variant of SEQ ID NO:Y; or 

(i) a species homologue of the SEQ ID NO: Y. 

12. The isolated polypeptide of claim 11, wherein the secreted form or the 
full length protein comprises sequential amino acid deletions from cither the C- 
terminus or the N -terminus. 

13. An isolated antibody that binds specifically to the isolated polypeptide 
of claim II. 

14. A recombinant host cell that expresses the isolated polypeptide of 
claim 11. 

15. A method of making an isolated polypeptide comprising: 

(a) culturing the recombinant host cell of claim 14 under conditions such that 
said polypeptide is expressed; and 

(b) recovering said polypeptide. 



16. 



The polypeptide produced by claim 15. 
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1 7. A method for preventing, treating, or ameliorating a medical condition, 
comprising administering to a mammalian subject a therapeutically effective amount 
of the polypeptide of claim 1 1 or the polynucleotide of claim I. 

18. A method of diagnosing a pathological condition or a susceptibility to 
a pathological condition in a subject comprising: 

(a) determining the presence or absence of a mutation in the polynucleotide of 
claim 1; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or absence of said mutation. 

19. A method of diagnosing a pathological condition or a susceptibility to 
a pathological condition in a subject comprising: 

(a) determining the presence or amoum of expression of the polypeptide of 
claim 11 in a biological sample; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or amount of expression of the polypeptide. 

20. A method for identifying a binding partner to the polypeptide of claim 
1 1 comprising: 

(a) contacting the polypeptide of claim 1 1 with a binding partner; and 

(b) determining whether the binding partner effects an activity of the 
polypeptide. 



21. The gene corresponding to the cDNA sequence of SEQ ID NO: Y. 

22. A method of identifying an activity in a biological assay, wherein the 
method comprises: 

(a) expressing SEQ ID NO:X in a cell; 

(b) isolating the supernatant; 

(c) detecting an activity in a biological assay; and 

(d) identifying the protein in the supernatant having the activity. 
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23. The product produced by the method of claim 20. 
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<210> 1 

<211> 733 

<212> DNA 

<213> Homo sapiens 



<400> 1 

SFSrgatccgga 

aattcgaggg 

tctcccggac 

tcaagttcaa 

aggagcagta 

ggctgaatgg 

agaaaaccat 

catcccggga 

atccaagcga 

ccacgcctcc 

acaagagcag 

acaaccacta 

gactctagag 



gcccaaa~ct 
tgcaccgtca 
tcctgaggtc 
ctggtacgtg 
caacagcacg 
caaggagtac 
ctccaaagcc 
tgagctgacc 
catcgccgtg 
cgtgctggac 
gtggcagcag 
cacgcagaag 
gat 



tctgacaaaa 
gtcttcctct 
acatgcgtgg 
gacggcgngg 
taccgtgtgg 
aagtgcaagg 
aaagggcagc 
aagaaccagg 
gagtgggaga 
tccgacggct 
gggaacgtct 
agcctctccc 



ctcacacatg 
tccccccaaa 
tggtggacgt 
aggtgcataa 
tcagcgtcct 
tctccaacaa 
cccgagaacc 
tcagcctgac 
gcaatgggca 
cctr.cttcct 
tctcatgctc 
Ugtctccggg 



cccaccgtgc 
acccaaggac 
aagccacgaa 
tgccaagaca 
caccgtcctg 
agccctccca 
acaggtgtac 
ctgcctggtc 
gccggagaac 
ctacagcaag 
cgtgatgcat 
taaacgagtg 



ccagcacccg 
accctcatga 
gaccctgagg 
aagccgcggg 
caccaggact 
acccccatcg 
accctgcccc 
aaaggcrtct 
aactacaaga 
ctcaccgtgg 
gaggctctgc 
cgacggccgc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
733 



<210> 2 

<211> 5 

<212> PRT 

<213> Homo sapiens 

<220> 
<221> Site 
<222> (3) 
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<223> Xaa equals any of the twenty naturally ocurring L-amir.o acids 
<4O0> 2 

Trp Ser Xaa Trp Ser ' 
1 5 



<210> 3 

<21i> 86 

<212> DNA. 

<213> Homo sapiens 



<400> 3 

gcgcctcgag atttccccga aatctagatt tccccgaaat gatttccccg aaatgatttc 60 
cccgaaatat ctgccatctc aatztag gg 



<210> 4 
<211> 27 
<212> DNA 

<213> Homo sapiens 
<400> 4 

gcggcaagct ttttgcaaag cctaggc 



<210> 5 

<211> 271 

<212> DNA 

<213> Homo sapiens 

<40O> 5 

ctcgagattt ccccgaaatc fcagatttccc cgaaatgatt tccccgaaat gatttccccg 60 

aaatatctgc catctcaatt agtcagcaac catagtcccg cccccaactc cgcccatccc 120 

gcccctaact ccgcccagtt ccgcccattc tccgccccac ggctgactaa ttttttttat 180 

ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt 240 

ttttggaggc ctaggctttt gcaaaaagct t 271 



<210> 6 

<211> 32 

<212> DNA 

<213> Homo sapiens 

<400> 6 

gcgctcgagg gatgacagcg atagaacccc gg 32 

<210> 7 

<211> 31 

<212> DNA 

<213> Komo sapiens 

<400> 7 

gcgaagcttc gcgactcccc ggatccgcct c 31 



<210> 3 
<211> 12 
<212> DNA 
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3 

<213> Homo sapiens 

<400> 8 
ggggactttc cc 



<210> 9 
<211> 73 
<212> DNA 

<213> Homo sapiens 
<400> 9 

gcggcctcga ggggaccttc ccggggactt tccggggact ttccgggact tcccatcctg 
ccatctcaac tag 



<210> 10 
<211> 256 
<212> DNA 

<213> Homo sapiens 
<400> io 

ctcgagggga ctttcccggg gactttccgg ggactttccg ggaccttcca tctgccatct 
caattagtca gcaaccatag tcccgcccct aactccgccc atcccgcccc taactccgcc 
cagttccgcc cattctccgc cccatggctc actaattttt tttatttatg cagaggccga 
ggccgcctcg gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg 
cttt-gcaaa aagctt 



<210> 11 

<211> 1564 

<212> DMA 

<213> Homo sapiens 



atgtcaaatg gggaaccctg gtacaagata ttttcaccta tgctaaagta ttggcactga 180 

tcgcggtcat cgttgcaggc attgttagac ttggccaggg agcctctact cattttgaga 240 

attcctntga gggttcatca tttgcagtgg gtgacattgc cctggcactg tactcagctc 300 

tgttctccta ctcaggctgg gacaccctca actatgtcac tgaagagatc aagaatcctg 360 

agaggaacct gcccctctcc attggcatct ccatgcccat tgtcaccatc atctatatci 420 

tgaccaatgt ggcctattat actgtgctag acatgagaga catcttggcc agtgatgctg 480 

ttgctgtgac ttttgcagat cagatatttg gaatatttaa ctggataatt ccactgccag 540 

Ctgcattatc ctgttttggt ggcctcaatg cctccattgt ggctgcttct aggcttttct 600 

ctgtgggctc aagagaaggc oatcfcccctg atgccacctg catgatccac gttgagcggt 660 

tcacaccagt gccttctctg ctcttcaatg gtatcatggc attgatctac tcgcgcgtgg 720 

aagacatct.t ccagctcatt aactactaca gcttcagcta ccggttcttt gtggggcttt 7B0 

ctattgtggg tcagctttat ctgcgctgga aggagcctga tcgacctcgt cccctcaagc 840 

tcagcgtttt cttcccgatt gtcttctgcc tctgcaccat cttcctggtg gctgttccac 900 

tttacagtga tactatcaac tccctcatcg gcattgccat tgccctctca ggcctgccct 950 

tttacttcct catcatcoga gtgccagaac ataagcgacc gctttacctc cgaagatcgt 1020 

ggggtctgcc acaaggtacc tccaggtcct gtgtatgtca gttgctgcag aaatggattt 1080 

ggaagatgga ggagagacgc ccaagcaacg ggatcccaag tctaactaaa caccacctgg 1140 

aatcctgatg tggaaagcag gggtttctgg tctaccggct agagcraagg aagttgaaaa 120O 

ggaaagctca cttctttgga ggcacctgtc cagaagccug gcctaggcag cttcaacctt 1260 

tgaacttact ttttgaaatg aaaagtaatc tatttgtttt. gctacatact gttccagact 1320 

tttaaagggg acaatgaagg tgactgtggg gaggagcatg tcaggtctgg gcttggttgt 1380 

Cttagaagca cctgggtgtg cctacctact cctcttttct tttaaaaggg cccacaatgc 1440 
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tccaatttcc tgtctccttt agagagacat gaaactatca caggf.gctgg atgacaataa 
aagtttatgt tcctaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaagggcggc 
cccg 



1500 
1560 
1564 



<210> 12 
<2H> 1757 
<212> DNA 

<213> Komo sapiens 



<400> 12 

ggcacgaggt 

acagacagga 

gcccctgctg 

ttgaggaggc 

tcttcaaagc 

tgtctgtgcc 

ggtgggctcc 

atggcgggcg 

gctagcaacc 

ttgtggtgcg 

gattttcccg 

ccctgacccc 

gtgcgtgcac 

acctcagatg 

accagagctg 

tttatctctt 

ggagcactta 

gcaactctat 

agtaactaat 

tctgactgac 

tagaaagtgg 

gaagagttat 

tactccagag 

atcggcttcc 

ttatctactg 

tttataaaag 

ggggataatt 

gggtgcagtg 

cctgaaggcc 

aaaaaaaaaa 



agatgggttt 
ccctcagctg 
gagggtgcct 
agtctgcccg 
tgtcagacag 
ctgcccccag 
acccagttcg 
cccctccccc 
agcgagactc 
ccatttttta 
attttccagg 
ttgcgcttcc 
ccactgacct 
gaaatgcaga 
ttcctattcc 
aacaaagtaa 
ctatgtgttt 
aacatattaa 
gagtaacaat 
attgcr.cf.t:g 
ttttcztttt 
tttctaggtc 
ggaaatgtgg 
acacacagta 
gattttcctt 
aaaagcatac 
tgaaatattg 
gctcacactt 
aagagttcaa 
aaaaaaa 



ttggtgtgga 
caggtctgtt 
cccagttagg 
ttcrcagatc 
ggacatttaa 
aggtggagcc 
agcttcccgg 
agtctcgctg 
cc/tggggtag 
agcccgtcgg 
tgccgtccgt 
cgagtgaggc 
gcgcccactg 
aatcacccgt 
gcca tcttgg 
atactgcaca 
tacggtccaa 
tagattgtct 
accagcccct 
t tcagacaca 
caagttgttc 
ttccagaagg 
gcatagaaac 
tattatttct 
tcccagcccc 
ctcccaccta 
ctugagacac 
gtaatcccag 
gaccagcctg 



tgtcctttct 

ggagtaccct 

ctgctcgcgg 

cccagctgca 

gtctrgcagag 

tacagaggca 

ctgccttgct 

ccgcctcgca 

gaccctctga 

aaaagcgcag 

catccctttc 

aatgccttgc 

tctggcactc 

cttctgcgtt 

ctcctccccc 

cgtatttaca 

gtactctata 

caatttcact 

aaaacctagg 

gcactcttca 

ccattcatca 

ggctgc-gac 

tttttaaaaa 

taaaaatcag 

attcttcttt 

tagcrtccct 

aattctcatt 

aactttggga 

gaatatatac 



gtttgttagt 

gcaatgtgag 

gtcaggggtc 

tgctgggaga 

gttactgctg 

ggcaggcctc 

tacctaagca 

gtttgatctc 

gccaggtggg 

tattcaggtg 

tttgattagg 

gctgctttgg 

cctagtgaga 

gctcaggctg 

accccccgcc 

tgttttaaaa 

tcatzcactg 

gatgacaaaa 

gtctttccac 

cctagaaaag 

atattctatc 

acttgggaat 

ctgaattttc 

ccagaaatgc 

aagtagtttt 

ccttatgg ta 

aagaaataaa 

ggccaaggcg 

caagatcccc 



tgtccttcca 
gtgtcagtgt 
agggacccac 
acsactgctc 
tctttttgtt 
ctcgagctgt 
agcctgggca 
agactgctgt 
ggatataatc 
ggagtgaccc 
aaagggaact 
ctcgtgcatg 
tgaacccggt 
ggagctgtag 
caatcctaag 
tattgtttat 
actctttata 
ctggcagctt 
tacagcacac 
gatggagagc 
aagatatatg 
tcaaaataat 
ccgatctaat 
acctgcagtt 
ttttccccat 
cattatctca 
ctttcaggct 
ggggaggatc 
tctccacaaa 



60 
120 
180 
240 
300 
360 
420 
*480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1630 
1740 
1757 



<:210> 13 

<211> 1373 

<212> DNA 

<213> Homo sapiens 



<400> 13 

gtgaaacgaa 

tggatctgga 

attccctgcc 

gtatgttttg 

ctctgtttcc 

tccagcttta 

ataatcactc 

agcctgaaaa 

gaatgagagc 

tgtatattaa 



acatgtatac 
aaat-ccctt 
tttccatatt 
attgttaatt 
tgttctggaa 
trtatcttaa 
tagtcagtga 
ggatcagagg 
CLtgtattat 
tttacgcaaa 



caaaacacat 
tctgctgntc 
ttgaaatgtt 
ttattgacca 
tgttccctgc 
gtatttttac 
aatcaaactg 
aagcactttt 
tcagccttgt 
atacacacat 



aaatttaagt 
ttcartctga 
ttggtcaaat: 
tcatcccagt 
tc tggaag cc 
aattatttta 
acaacacaaa 
tagtctaagt 
gatagaagga 
ratcttccca 



tttataattt 
tagttgcttt 
tutctcctgt 
ggattattta 
atattttaac 
ctgataatat 
atgatgctgc 
agtcaagatc 
aaaagtacag 
ttaacacagt 



tctttccttg 
ggctttccct 
tgagtgtgca 
rttgaaactt 
tgtatactga 
ataactaggc 
tagzaaattt 
aataaactta 
tcactgaatc 
agtatgacat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



WO 00/06698 



5 



PCT/US99/17130 



acagttttat 
aaagtagctt 
tgagtttgaa 
tcatttttgt 
ttgtgactgg 
tcatttgctt 
tgtacgtata 
atgtgtgtat 
aatgtatttg 
tattctgtat 
aatagtattg 
tagggctcac 
gctattacat 



caccactgga 
cttatcttcc 
gaagttcagg 
cctaccacta 
ccctcagtta 
tttczcttgc 
tgcatatcca 
tccctacccc 
ttttaatgct 
ttgtaact-c 
attggaagga 
aagtgtataa 
taaaaaagct 



tfccatgtatg 
cagatacagc 
taaccctctc 
aaagcttgct 
gacctgcatc 
agtgtatttc 
tatggtcaag 
ttgttttgct 
ttctctttag 
ccgagaatct 
cagtgaagta 
gaagcatccc 
aztctcatta 



cttaaaatga 
ctccaagagt 
ctaaccagca 
tattgacctc 
tgttgctact 
tgaatgttTt 
agttttcaga 
ctgtcttaac 
attgtaagct 
agtattttat 
caaaatgaga 
tttcctattt 
aaaaaaaaaa 



aaatgaaagg 
gcctctgctg 
gctcaactta 
cttctttcct 
tgtagcagca 
ataactctgg 
attctttgtt 
atttaagctt 
cttcatgggc 
catgtacata 
ttctaaaatg 
ttttgtttaa 
aaaaaaaaaa 



aacacacagg 
tgtactggag 
aagtgcatct 
gtctcctaaa 
attgcaggat 
gtatgcataa 
tgaataatga 
tgcttattaa 
agtggccaga 
atgaacatag 
acattttcat 
tactttagaa 
aaa 



660 
720 
780 
840 
900 
960 
L020 
1080 
1140 
1200 
1260 
1320 
1373 



<210> 14 
<211> 3740 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (957) 
<223> n equals a,t,g, 



<400> 14 

actctctaag 

tcacttaagg 

ctacttctca 

tcatttatgt 

taagtccaat 

ccatgatgca 

cagtttcctt 

aactgcatat 

ctattgcact 

agtttatttt 

gataaaacaa 

aaacatcatt 

tgaagcaatc 

tcttccaaac 

tgctgaaaga 

gaagtacaag 

tatatgcaga 

cagtatttca 

tttcatattc 

gcagtactaa 

atctgttttt 

ttaaagaa&t 

caattcagtc 

attaaatctt 

gatgttttgc 

aaccag-ttt 

aatgccatca 

gtggagatgc 

tgctcttggt 

ggagaaggaa 

aaccaatctt 

tttgcaggtt 

tagctctctt 



gcttttttgg 

ttgactgtgc 

cagggcctgt 

ggagctgact 

gaaagatgct 

gacggccact 

aaagagctcc 

gcaaatggaa 

aagactggtg 

gaagcaaatg 

tcagcagaac 

gactfcgttta 

ttggctctga 

agacaactta 

caagcagtta 

ctttctcgag 

agcagactca 

gctggctgga 

ctgagaaact 

gagatttata 

cttaaacact 

ttacctctaa 

actaactgta 

attaccaact 

agcattcccc 

tccattgcag 

ggaactcaga 

tcatgcaata 

gaaggtatgt 

caccattctg 

aactgaacca 

taagacaac- 

aggcaccata 



actcaccaaa 
caaatggcat 
cagttcagct 
ttgtgaaaag 
gttcttttga 
t tea tc teat 
tggtggagac 
gttcaacacg 
taaaacattt 
ggcatggcac 
aactggaaga 
accaggcagc 
agggcttgac 
aagttcaggt 
cacccccagg 
cttttgtccg 
caagaaagtg 
ggaattggag 
ggacttttta 
atcataatgt 
aacagaataa 
tttcagtctc 
cagttttatg 
ccttgtgata 
attgetgata 
agtcttaaga 
ggtazgaata 
ccttcaagga 
gggagaaaaa 
ataccagaac 
agtctgaaac 
caggacaata 
atcagtatga 



caggcttctt 
aggggccccg 
gtttaatgat 
tcatcagaaa 
tggagatgea 
agatggagac 
tggagaaagc 
gtatcttgaa 
gcaccacaag 
tgcactgttt 
taagaaaaga 
tggtgatgct 
tgtacaacag 
tgcagacagg 
attacaggag 
gccctctggt 
cagatcacct 
aaaggcccca 
caagtcttta 
ttacaatgea 
ttcttzataa 
actaatgtaa 
tggggaacaa 
tetttgecat 
caaatggaga 
aagattatta 

gggggttgtc 

tgcatggtcc 
ctaattataa 
atggttaata 
caatggaaaa 
aaaacaaigg 
gecaacaata 



gcagtggaga 
aagctaaggg 
gggtccaagg 
cctccacagg 
gacagaattg 
aagatagcaa 
ttgaatattg 
gaagttatga 
gctcaagagt 
agtacagctg 
aaagctgeta 
atttctgaca 
tgggatgetc 
agagttatta 
gcaatcaatg 
acagaagatg 
tgcacatgaa 
accaggtttc 
caaaactgtc 
gcctactgga 
ataggtaagc 
aatactggga 
ttcatgeagg 
caccatcaca 
gggcagagaa 
gatgacttac 
catccctctt 
agecttcagt 
tacgtttccc 
aggaaaagag 
aaaaaatggg 
actttacatg 
ttnaaacttg 



tgaatacaga 

aaatggaaca 

gcaaactcaa 

gaatggaaat 

tttattacta 

cgttaattag 

gtgttgtaca 

aggtacctgt 

ttgacattgg 

ttgaaatgaa 

agatgettga 

tgctggtgat 

tctatacaga 

gcactaccra 

acctggtgaa 

tegtccngag 

gtgagcttgg 

tgaagataat 

aataataatg 

ttgtctctag 

cttacacttg 

cttaagtata 

ctactggaaa 

tgagcaagat 

gactttatac 

ccatatgact 

cca tact gag 

tatitcttcac 

agectctgat 

aaaaatcccc 

tagtgtatat 

tgtatatata 

attcaggeca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
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cattcagaca tttgctctta tatacaaata 
tgttacatac aaaaaaggaa aaactataca 
tacatttaca tgtaagctaa atggaaccag 
cagaaaacct ttttccacca nctcttctat 
gtggttctcc agtttcatgt ccttattagg 
tgagtgtcct ccacatcggc ttgtgacttt 
gaacggcagg agctgcccca tactgcgcac 
ctcccaaccc aatgttctcc ctgagttgcc 
agtgaactgg tattaacaca acagacaatg 
actttggcaa ttggtcaaga aatgaataca 
atcaaaaccc ttggtgcttc tgagaccttt 
gtctaacatt gtagtaatag ttcccaacta 
tagctttgaa caggaacaga gtcaaacata 
aaaaagcccc caggtttc eg aaccctcact 
ctacagaaaa aaggaacaaa gcgctttata 
ttagttaatt ccc-tttatc taaatggcct 
gtgaataatt actgtcag ta^ctgcatcaca 
gctgtctcct tctgggttgt gctaaacaac 
ggaattgagt tagtgtgctg gctctgccat 
aggttcctca cttataaaat gggacagtct 
agaaggtatg tgaaaactct gtcaactaaa 
gctagtttga gaagccacct gaattacaca 
gaaagataag agagaacagg ttggttgaac 
taaagaatga aaataagcaa tgaactagac 
cagaattaca atctgtcata ttaacaaatg 
agaaactttc attgtttacc taataattta 
atgcaaaatc cagcttattt tcttctacgt 
aaatcctgaa aaaggaaaat attaggttgg 
gaaatttgcc gttttatatt ggagtacatt 
aaaaaaaaaa aaaactcgag 



tttaaattaa atacaatctg aaatgtgttc 
acgcagagca gtgtgtgtgt tttaaataat 
caatggtgcc caagttttta tcatcccttc 
ttcttgcctg gctttgctgg aacatggttt 
gaaggcattt gagtagagga taggactccc 
gctgttigaag acttgactga gcacattgaa 
ggtgcagatg gtgagcagca agtgccctgg 
ttcccctgga tttttctgca gaaaacaaaa 
tggtatgtta gaaaaattaa aaatatataa 
aatgacatta agtttctaac tcctgacctg 
tactgccatt tattagtttt acatggagca 
gaatgcgcag ataagcttag ttaacagaaa 
aaagttttat gttgtgcttt gtatttactc 
actgtaacca aggactaggt cacaaaatta 
catttcataa tatatcccct tttattataa 
aaatttgcca tgatggtagc agtgtccaaa 
gagaaaggaa gggatccctc aggagacact 
acagggagga aagctggacc cggagtcaaa 
acttacggca cccttgggca ggatatacaa 
aaaactacct tttagtagag aagtcaaatg 
tataaagact aataatttgg gtattaagag 
aacacagcta cagacatcat tccgtctaga 
ttgggcagaa tcacagacac aattccacac 
agaaggaaga aatcatgaag acttaggaag 
gagtttgcct tc-aagatca gatgttgctc 
atatcactag cttcctagtg ggtcaagcag 
gctctcaagc ttattgctta ttttaaagta 
tgcaaacgta attgcggttt Ctgcactgtt 
cttaaataaa tgtggttatg ttatacaaaa 



<210> 15 
<211> lt96 
<212> DNA 

<213> Homo sapiens 



<400> 15 

ccacgcgtcc ggtgaatctg gatgatctat 
tgccagtacc ccactgtctt gattattgtg 
taagctttgc aatcttttcc aagattgttt 
tattaagctt ctccattttt ctttaagaaa 
gtcgatttgg gaagaactga caataztgaa 
gcatttattc gggcctttta aaatttatct 
tcttgtatat atcttttgtt aaatgtattc 
tagtacttta atttcatttt taagtttatt 
taatactgac ttttatatirc tgtaatctta 
tatttttatt ttattgcaag tgccttaggg 
tgaataacca acaggcttac cttttgtttc 
ccaacctaaa tggttgaata gaagcggtgt 
cagcagagga tagcgtctag tttttcacta 
tttcatgctc ttatcagatt gaagatccct 
atgcatgttg aactccacca agtacttctt 
ccctttratt ttgttaatgt agattcttta 
aaccatactt agtcatgata tgttatcccc 
attttgtctg atttttgcat ctatgctcat 
tgtaagtctt gtcaggtgtc agggcttggt 
ttaagataaa aa:aaacttc aaaaaaaaaa 



ttatgtcart tatctatgtg tctcttctta 
gctttatggt gtcttgaaat taagtattgt 
tggctatagg ccctttcttt ataaagttta 
aaatctgctg gaatggcact gaatttttag 
cctrtcaatc gatggacatg gtatgtttct 
cagcaatact ttgtagcttt agtgaagaag 
ctaaatattt tgtggaaatg ttactgtaaa 
gctcgtacac agaaatatgt tttatttttg 
tttaaattca cttagttcag tagtagctct, 
acacctcgtg tacacaatcc atgtcatata 
ttcttgcttt gtcgtaatgg ctaggacctt 
gtgagttcat tactcttgtc ttattctcaa 
ttaaaatatg ttagctatag gtttttcccg 
tttattccta ttttgccaag agtgtcatga 
cctgcatctg ttgagagaat cttgacttt- 
atgttaaact gatcttgcat Ccctgatata 
tttatataat gctggattcg gtttgcaaat 
gagacagatt ggtccgtgat tttcctttct 
caatcttgac acatntacaa acatatattt 
aaaaaaaaaa aaaaaaaaaa aaaaaa 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3740 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1196 
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<210> 16 
<211> 2209 
<212> DMA 

<213> Homo sapiens 



<400> 16 

gagctgcgcc gggcttgggc gcctggggcc gccgctcccc accgtcgttt cccccaccga 60 

ggccgaggcg tcctggagtc atggccggcc tgaactgcgg ggtctctatc gcactgctag 120 

gggttctgct gctgggtgcg gcgcgcctgc cgcgcggggc agaagctttt gagattgctc 180 
tgccacgaga aagcaacatt acagttctca taaagctggg gaccccgacr ctgctggcaa 
aaccctgtta catcgtcatt tctaaaagac atacaaccat gttgtccatc aagtctggag 
aaagaatagt ctttaccttt agctgccaga gtcccgagaa tcactttgtc atagagatcc 
agaaaaatat tgactgtatg tcaggcccac gtccttttgg ggaggttcag cttcagccct 

cgacatcgct gttgcctacc ctcaacagaa ctttcatctg ggatgtcaaa gctcataaga 480 

gcatcggttt agagctgcag ttttccatcc ctcgcctgag gcagatcggt ccgggrgaga 540 

gctgcccaga cggagtcact ca^tccatca gcggccgaat cgatgccacc gtggtcagga 600 

ttggaacctt ctgcagcaat ggcactgtgt cccggatcaa gatgcawgaa ggagtgaaaa 660 

tggccttaca cctcccatgg ttccacccca gaaatgtctc cggc-tcagc attgcaaacc ?20 

gctcatctat aaaacgtctg tgcatcatcg agtctgtgtt tgagggcgaa ggctvcagcaa -* 8 0 

ccctgatgtc tgccaactac ccagaaggct tccctgagga tgagctcatg acgtggcagt 840 

ttgtcgttcc cgcacacctg cgggccagcg tctccttcct caacttcaac ctctccaact 900 

gtragaggaa ggaggagcgg gttgaatact acatcccggg ctccaccacc aaccccgagg 960 

tgttcaagcc ggaggacaag cagcctggga acatggcggg gaacttcaac ctctctctgc 1020 

aaggctgtga ccaagatgcc caaagtccag ggatcctccg gctgcagttc caagttttgg 1080 

tccaacatcc acaaaatgaa agcaataaaa tctacgtggt tgacttgagt aatgagcgag 1140 

ccatgtcact caccatcgag ccacggcccg tcaaacagag ccgcaagttt gtccctggct 1200 

gtttcgtgtg tctagaatct cggacctgca gtagcaacct caccctgaca cccggctcca 1260 

aacacaaaat ctccttcctt tgtgatgatc tgacacgtct gtggatgaat gtggaaaaam 1320 

ccacaagytg cacagaccac cggtactgcc aaaggaaatc ctactcacty caggtgccca 1380 

gtgacatcc^ ycamctgcct gtggagctgc atgacttctc ctggaagctg ctggtgccca 1440 

aggacaggct cagcctggtg ctggtgccag cccagaagct gcagcagcat acacacgaga 1500 

agccctgcaa caccagcttc agctacctcg tggccagtgc catacccagc caggacctgt 1560 

acttcggctc cttccgcccg ggaggctcta tcaagcagat ccaggtgaag cagaacatct 1620 

cggtgaccct tcgcacccct gcccccagct tccgacaaga ggccrccagg cagggtctga 1680 

cggtgtcctr. tataccttat ttcaaagagg aaggcgtttt cacggtgacc cctgacacaa 1740 

aaagcaaggt ctacc-gagg acccccaacc gggaccgggg cctgccatcc ctcacctctg 1800 

tgtcctggaa catcagcgtg cccagagacc aggtggcctg cctgactctc tttaaggagc I860 

ggagcggcgt ggtctgccag acagggcgcg cattcatgat catccaggag cagcggaccc 1920 

gggctgagga gatcttcagc ctggacgagg atgtgctccc caagccaagc ttccaccatc 1980 

acagcttctg ggtcaacatc tytaaytgma gccccacgag cggcaagcag ctagacctgc 2040 

tcttctcggt gacacttacc ccaaggactg tggacttgac tgtcatcctc atcgcagcgg 2100 

tgggaggtgg agtcttactg crgtctgccc tcgggctcat catttgccgt gtgaaaaaaa 2160 

aaaaaarama aacaaggggc cccgctgtgg gtatctacaa tggcaacat 2209 

<210> 17 
<211> 1774 
<212> DNA 

<213> Homo sapiens 
<400> 17 

taggcacccc aggcttttac actttatgtx ccggcccgta tgttgtgtga attgtaaccg 60 

gacaccaatt tcacacaagg aaccagctta tgaccatgat tacgccaagc tcgaaattaa 120 

ccctcactaa agggaacaaa agctggagct ccaccgcggt ggcggccgct ctagaactag 180 

tggatccccc gggctgcagg aattcggcac gagcggccgc gggcgcgagg tgaggggcgc 240 

gaggttccca gcaggatgcc ccggctctgc aggaagctga agtgagaggc ccggagaggg 30 0 

cccagcccgc ccggggcagg argaccaagg cccggctgtt ccggctgtgg ctggtgctgg 360 

ggtcggtgtt catgatfcctg ctgatcatcg tgtactggga cagcgcagqc gccgcgcact 420 



240 
300 
360 
420 
480 
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tctacttgca cacgtccttc tctaggccgc acacggggcc gccgctgccc acgcccgggc 480 
cggacaggga cagggagctc acggccgact ccgatgtcga cgagtttctg gacaagtttc 540 
tcagtgctgg cgtgaagcag agtgaccttc ccagaaagga gacggagcag ccgcctgcgc 600 
cggggagcat ggaggagaac gtgagaggct acgactggcc cccgcgcgac gcccggcgca 660 
gcccagacca gggccggcag caggcggagc ggaggagcgt gctgcggggc ttctgcgcca 720 
actccagcct ggccttcccc accaaggagc gcgcattcga cgacatrrccc aactcggagc 780 
tgagccacct gatcgtggac gaccggcacg gggccatcta ctgctacgtg cccaaggtgg 840 
cctgcaccaa ctggaagcgc gtgatgatcg tgctgagcgg aagcctgctg caccgcggtg 900 
cgccctaccg cgacccgccg cgcatcccgc gcgagcacgt gcacaacgcc agcgcgcacc 960 

tgaccttcaa caagttctgg cgccgctacg ggaagctctc ccgccacctc atgaaggtca 1020 

agctcaagaa gtacaccaag ttcctcttcg tgcgcgaccc cttcgtgcgc cugatctccg 1080 

ccttccgcag caagttcgag ctggagaacg aggagttcta ccgcaagttc gccgtgccca 1140 

tgctgcggct gtacgccaac cacaccagcc tgcccgcctc ggcgcgcgag gccntccgcg 1200 

ctggcctcaa ggtgtccttc gccaacttca tccagtacct gctggacccg cacacggaga 1260 

agctggcgcc cttcaacgag cactggcggc aggtgtaccg cctctgccac ccgtgccaga 1320 

tcgactacga cttcgtgggg aagrtggaga ctctggacga ggacgccgcg cagctgctgc 13 BO 

agctactcca ggtggaccgg cagctccgct tccccccgag ctaccggaac aggaccgcca 1440 

gcogctggga ggaggactgg tccgccaaga tccccctggc ccggaggcag cagctgtata 1500 

aactctacga ggccgacttt gttctcttcg gcraccccaa gcccgaaaac ctcctccgag 1560 

actgaaagct ttcgcgttgc cttttctcgc gtgcctggaa cctgacgcac gcgcactcca 1620 

gtttttttac gacctacgat tttgcaatct gggcttcttg ttcactccac tgcctctatc 1680 

cattgagtac tgtatcgata ttgtttttta agattaatat atttcaggta tttaatacga 1740 

aaaaaaaaaa aaaaaaacic gagggggggc ccgg 1774 



<210> 10 

<211> 1674 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1649) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1663) 

<223> n equals a,t,g. or c 
<220> 

<221> SITE 
<222> (1665) 

<223> n equals a,t,g, or c 



<400> 18 

caagttggta cgcccgcagg taccggtccg gaattcccgg gtcgacccac gcgtccggtc 60 

gaagataggt tcgagacrggt ggatgttgca gctgaccatg cagttgggtc cggtgccgct 120 

cacacgctgc cccttttggg gctgcttcag ccagctcatg ctgtacgccg agagggctga 180 

ggcacgccgg aagcccgaca tcccagtgcc ttacctgtat ttcgacatgg gggcagccgt 240 

gctgtgcgct agtttcatgt cctttggcgt gaagcggcgc tggctcgcgc tgggggccgc 300 

actccaattg gccattagca cctacgccgc ctacatcggg ggccacgLcc actacgggga 360 

ctggctgaag gtccgtatgc actcgcgcac agctgccatc atcggcggct ttcttgtgtt 420 

ggccagcggt gctggggagc tgtaccgccg gaaacctcgc agccgctccc tgcagtccac 480 

cggccaggtg ttcctgggta tctacctcat ctgtgtggcc tactcactgc agcacagcaa S40 

ggaggaccgg ctggcgtatc tgaaccatct cccaggaggg gagccgatga tccagctgtt 600 

cttcgtgctg tatggcatcc tggccctggc ctt tctgtca ggctactacg tgaccctcgc 660 

ugcccagatc ctggctgtac tgccgccccc tgtcatgctg ctcattgatg gcaatgttgc 720 

ttactggcac aacacgcggc gtgttgagtt ctggaaccag atgaagctcc ttggagagag 780 
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tgtgggcatc ttcggaaccg ctgtcatcct ggccactgat ggctgagtt- tatggcaaga 940 

ggctgagatg ggcacaggga gccactgagg gtcaccctgc cttcctcctt gctggcccag 900 

ctgctgtcta tttatgcttt tcggtctgtt tgtttgatct ttzgcctttt taaaattgtt 960 

ttttgcagtt aagaggcagc tcatttgucc aaatttctgg gctcagcgct tgggagggca 1020 

ggagccccgg cactaatgct gtacaggttt ttttcctgtt aggagagctg aggccagctg 1080 

cccactgagt ctcccgtccc tgagaaggga gtatggcagg gctgggatgc ggctactgag 1140 

agtgggagag tgggagacag aggaaggaag atggagattg gaagtgagca aatgtgaaaa 1200 

attcctctt- gaacctggca gatgcageta ggctctgcag tgctgtttgg agactgtgag 1260 

agggegtgtg tgtgttgaca catgtggatc aggcccagga agggcacagg ggctgagcac 1320 

tacagaagtc acatgggttc tcagggtatg ccaggggcag aaacagtacc gactctctgt 1380 

cactcacctt gagagtagag cagaccctgt tctgctctgg gctgtgaagg ggtggagcag 1440 

gcagtggcca gctttgccct trctgctgtc tctgtttcta gctccatggt tggcctggtg 1500 

ggggtggagt tccctcccaa acaccagacc acacagtcct ccaaaaataa acattttata 1560 

tagainaaaaa aaaaaaaaaa aagggcggcc gctctagagg atccctcgaa gggcccaagc 1620 

ctacgcgcgc atgcgacgtc atagctctnt ccctatagaa gtngnaaagg gttc 1574 

<210> 19 i 
<211> 2018 

<212> DNA * 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (2010J 

<223> n equals a.t.g, or c 
<220> 

<221> SITE 
<222> (2012) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (2014) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (2016) 

<223> n equals a,t,g, or c 
<400> 19 



60 
120 
180 
240 



cagcaatigaa atcctgccct ccttccctca gaactactat attcagtggc taaatggctc 
cctgactcac ggtttgtgga atcttgcttc ccttttttcc aacctttgyt tattcgtatt 
gatgcccttt gcctttttct ttctggaatc agaaggcttr. gctggcctga aaaagggaat 
ccgagcccgc attttagaga ctntggtcat gcttcttctt cttgcgttac tcattcttgg 240 
gatagtgtgg gtagcttcag cactcattga caacgatgcc gcaaacatgg aatictttata 300 
tgatctctgg gagttctatc taccctattt atattcctgt atatcattga tgggatgttt 360 
grcacttctc ttgtgtacac cagtcggcct ttctcgtatg ttcacagtga tgggtcactt 
gctagtgaag ccaacaattc ttgaagacct ggatgaacaa atttatatca ttaccttaga 
ggaagaagca ctccagagac gactaaatgg gctgtcttca tcggtggaat acaacaLaat 
ggagctggaa caagaactitg aaaatgtaaa gactctcaac acaaaactag agaggcgaaa 
aaaggctcca gcatgggaaa gaaatttggt gtatcccgct gttatggttc tccttcttat 660 
tgagacatcc atctcggtcc tcttggcggc ttgtaatatt ctttgcctat tggttgatga 72 0 

aacagcaatg ccaaaaggaa caagggggcc tggaatagga aatgcctcCc ttt=ta=gtt 
tggttttgtg ggagctgcgc ttgaaatcat ttrga-tccc tatcttatgg tgtcctctgt 
tgtcggcctc tatagccttc gatttcttgg aaactttact cccaagaaag atgacacaac 
tatgacaaag accattggaa attgtgtgtc catcttggtt ttgagctctg ctccgcctgn 



420 
480 
540 
600 
660 



780 
840 
900 
960 
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gatgtcgaga acactgggaa tqactagatt tgatctactt ggcgactttg 
-tggctggga aatttctata ttgtattatc ctacaatttg ctttttgcta 
actgtgtctg gtccgaaaac tcacctctgc agttcgagaa gaacttttca 
gctzcataaa cttcacttac caaatacttc aagggattca gaaacagcca 
aaatgggcac cagaaagcac cgtgagacgc acagacggcg tcttctgcca 
cgagaactcc agattcacga cattcctgtc ccatgtagaa gcatttccat 
cccctctnca gaacctagac ctatcagtgc catttttttt tcataatcta 
ggctatggct gatctttfctt aaat~taact ttctgatgga ccctgtagtt 
tgcagattcc ttacagacat anagaacagc gcattcttct gtagacattt. 
gtaaatacaa tcacccatar gaaaaaattg tttccacctg atatgaaaat 
gcaaactccg ggacttc-aa agatttacct aaatcccatt atgtacttta 
agaagctgac ctgaaaggca tccttggtac taagtgaagc ttattcagaa 
tcaaatgcaa cggcaactgc ttgtagatat catttttgca gtgtatgttg 
ggttgcaatt atgtttctta tttccttaaa agcaaaaagc gtagtttctg 
tagaatgata ctgattagac tttgagccaa ggggaaaaca ctaaattctt 
agccttagag agccacagga atatcttctg ttgtacagtc taataagctg 
tatcatgtaa tcacagttta ^atgacagttt atgtatatat ataattcagt 
Sggggggccc ggtacccaat tcgcccaagn tnancnag 



gaaggtttaa 
ttgtgacaac 
aggccctagg 
agccttctgt 
ccaagagacc 
tcaaccgtgg 
cgaagaactt 
tccagttaag 
gctcacgttg 
gttagaaaag 
ttcagaatgt 
aatgcatttt 
gagctgtaat 
atttatgtta 
ttaaacctgg 
tggtaggaag 
attccctcga 



1020 
1000 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1900 
2018 



<210> 20 
<211> 2098 
<212> DNA 

<213> Homo sapiens 



<400> 20 

ggcacgaggg 

tattttctgc 

tctcaatgca 

ggtccagacc 

tcttccgggg 

tgctcctata 

gtcatggaaa 

cagaagagtg 

tggccaacct 

tacct.ctccc 

aaataatttt 

gtttgfcaacg 

gcaa&agctg 

C-Cgtgttgg 

ctcattccgc 

tgacatcaat 

tcatcactta 

gagccctata 

tgtatttttt 

tcacttttta 

attagccagt 

catttaagac 

agctgactga 

ttgtgtggaa 

cttgttggta 

tgr.tggaata 

ctgacaagca 

catgttttat 

gatctttttt 

attgctattt 

ttccatcttt 

tatatttaag 

caaatcattt 



accgagctaf. 
tgggaatcac 
ccttgctgaa 
gccggaaact 
aaaagctcct 
tacctaaatg 
acttcaggaa 
ttatcc taac 
gtatgatggc 
tactatgtga 
tgttaaagct 
tgcagtctgt 
ttctatatgc 
ttggtggttt 
caaaacaggg 
gtgataaagt 
ttgtttacta 
gcaagtgaag 
accatatcac 
ttcataactt 
tctataatat 
cctgctactg 
atgtcggrag 
atctttattt 
aaaattggtg 
ttgctctttg 
ttccctatcg 
ttttccaccc 
ctaacacata 
acttctgtgt 
catttaacat 
taccttt- tg 
gtggaatgaa 



tctcctggga 
actcctgcgc 
tgcgtccatc 
ttctcagtac 
cctctaccac 
tggaaaaaat 
gtatcaacac 
aaaactctac 

-gggggtgtg 

gaggatccaa 
caaatactgt 
tatgagttcc 
aaacatgatg 
tcataatctt 
ctcagttatt 
ctgtgttctg 
aagctacagc 
ggaccagact 
tgtaaagaag 
cagatttgta 
cttcctgcga 
tgtgaagaga 
gcgccccatc 
cacttcaatt 
gactggtttc 
tgaatttgtg 
ggattttaaa 
ataartccct 
tttgaactga 
ctgggggaaa 
caatatcaaa 
Caatgtagza 
tggtaaaact 



ctggctatga 
tcatacatgc 
acggaaacat 
ccctgcctcc 
acagaagaga 
tttgaagaat 
ttctcctgct 
agttccaacg 
gcaattgttg 
cggatcaata 
tttctttcac 
ctaatatatt 
tccttattat 
atttctgtac 
catttgccaa 
agttgtcaga 
caaaaatatt 
tcctaattaa 
aggggaaacc 
aaaccaattt 
tttatgtaga 
tgatacttac 
acaatccagg 
taaccattag 
aatgggtaaa 
cctaagtcaa 
gc tatgngca 
tttttgttgt 
ataacagact 
atacgaggga 
agaagaagac 
tcaaagcttt 
aatctgazga 



tggtgtgctc 

agagcgtgcg 

ttaattgctc 

aggtgtacgt 

caataaaaat 

ccatgtccct 

attctgaccc 

tgctgttcca 

ccatggcgaa 

gataaatgca 

tcttcaccaa 

cttatatgte 

tcaggagaai: 

tggaactagc 

gcttcgtgga 

tctcttgaag 

tttttttctt 

aggaagtcag 

cagccagcta 

ccaaaatata 

aaatgaacac 

aaggagtgtc 

aaagtctgtg 

atggtaaaat 

t.gtgttgtgg 

tgaatgtgta 

cagaatacta 

tatactttat 

taaagaaagc 

ttgattttaa 

aaacatctac 

ttagg-aatg 

aatggaaaat 



catcatgatg 
gaccgaagag 
cctcagctgt 
taacctgact 
caatcagaag 
ggtgaatgtt 
agaaggaaac 
ttcactcttc 
acctacacag 
aaaatggata 
agaaccttaa 
gagcaataat 
aaataactgt 
actttcttcc 
ggaacgtagg 
acaacatttt 
attctaaact 
gtacttttct 
ccttttttca 
agctgttt.tc 
accccttttc 
altacctgtg 
ttactgatat 
taagatgcta 
caaattaatg 
gtatctcrctt 
gtctcttcta 
acacagaata 
ctttgttcac 
ataa&aaaca 
ctt-ctcatc 
caaaac-tta 
caCictgcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
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^««?^ C caCagttc ^ cttttcataa gcaaa-aaat ccctaggatg taatcaggac 2040 
ttcaaatgtg taattaaatt tttttaaaaa aaatctaaaa aaaaaaaaaa aaaaaaaa 2098 



<210> 21 

<211> 1746 

<212> DNA 

<213> Homo sapiens 



<400> 21 

gcgttcgcgc acctccagct cgggccgatg tggaagcttt ggagagctga agagggcgcg 60 

gcggcgctcg gcggcgcgct cctcctgctg cfccttcgcgc taggggtccg ccagctgctg 120 

aagcagaggc ggccgatggg cttccccccg gggccgccgg ggccgccatt tatcggcaac 180 

atctattccc tggcagcctc atccgagctt ccccatgtct acatgagaaa gcagagccag 240 

gcgtacggag aggcacagcc ccgacgggcc ccgggcaggg agggccgcca ggctggcccg 30O 

ggctggccag ggccttcctg gttggactta tggccgcccc tgggccgact agtegggacc 360 

tctccgtgtg ccggctgccc t^tgagggac acccgcttcc cgggtctgga agggagaagt 420 

cctcgacgcc gtgccccctt gcagggggag ccccgcccct gccggtgacc cactccgggc 480 
cgaggctccg aggcgatcca gtcctgattt tcccgctacc gctcgagctc ttgctcctgc 
gcctgcgccg tttggctcgc cagccgcgcc gccacttcag gtccagggtg gacgcatgcc 
cccaggcgcg ggcgtcttgc gagtcggcct cgcagctctg tggaagctgc acgcggcttg 
tcggaaaatc aaggcgttct gagttntaga tggttaatag caggttcttc ggtgtctgca 

gtcgacgaac gactggtgta ggcgtttgct gtgagaatgg agaatgcagg ggaacgcccc 780 

tgactgagaa gcgggccctg ggaaacgatt gtgaacgcgt gaatgaattg a-gactaaaa 840 

tccgctgcgg gggtcctaca gcgcagatgg taatgccgtt ctgactggct gggaacggca 900 
ccttagcaga tacttaaaag gcgccttctg tgtgccactg tcactgccaa cttggtgact 
catttaaaac tcataaccag ccggtgaggt cggtac-tcg ctcctcctca ttctgcggag 
gggaaagcag cacggaaatg ccctgtgact ggcagcggaa aaggcgacca ccgcttgtgt 
gtgggtgtcc cgacgtccgg agggggcagg agtttccacg ggtcctggga caoagctcac 
ctgttttgtt ttgaattaca cttatttata tgcaactaca ggcctgacgc tagcggtgaa 

gaaggcagat acagcctttt aaggagttgg cagatgagtg ggagagagaa aactaatctc 1260 

attatcggcc acaggctgtg gtcagtgttt -gaaggaaaa gtacagggat gtttggcaac 1320 

tgtggtattt caggtttgac cttaaatcct tacttaaacc agtttttaca aggattggtc 13 80 

taggtgcccg ggcgcggtgc tcacgcctat aatcccagca ctttgggagg ccgaggcggg 1440 

cggatcacga aatcaggaga tcgagaccgt cgtggctaac acggtgaaac cccatctcta 1500 

ccaaaagaat acaaaaaatt ggccgggcgt ggtggcgggc acctgtggtc ccagctattc 1560 

gggaggctgg ggcaggagag tggcgtgaac ccgggaggcg gagctttcag tgagccgaga 1620 

tcgcgccact gcactccagc ctgggcaaca gagccagact ccgtctcaaa aaaaaaaaaa 1680 

aaaaagggcg gccgctctag aggatccaag cttacgtacc cgtgcatgcg acgtcaatag 1740 



<210> 22 
<211> 2876 
<212> DNA 

<213> Homo sapiens 



540 
600 
660 
720 
780 



960 
1020 
1080 
1140 
1200 
1260 



1746 



<400> 22 

ccacgcgtcc ggcggcgagg cggcggtggt ggctgagtcc gtggtggcag aggcgaaggc 60 
Tr-r^l C l a gg ? gttggca ccggccccga gaggaggatg cgggcccgga tagggctgac 120 
gctgctgctg tgtgcggtgc tgctgagctt ggcctcggcg tcctcggatg aagaaggcag 
ccaggatgaa tccr.tagatt ccaagactac tttgacatca gatgagtcag taaaggacca 
tactactgca ggcagagtag ttgctggtca aatatttctt gattcagaag aatctgaatt 
agaatcctct attcaagaag aggaagacag cctcaagagc caagaggggg aaagtgtcac 
agaagatatc agctttctag agtctccaaa tccagaaaac aaggactatg aagagccaaa 
gaaagtacgg aaaccaggca gtctggacat tt-ccttgct ttttgattta tttaggggac 
aactgaaaat tttaagctaa tgaataaaga ggctgaagaa gactggcttc acteattatt 
acccacaaat aatatatgga gtgtagtttg gagagaatct ctagatttta atataccaaa 
gctattcact zaacagattt gacccaagtt acataagccg aattccatag atgagattgc 



180 
240 
300 
360 
420 
480 
540 
600 
660 
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atttatcttt tcatcaatat cctgacgttt ctttt-ggaa tcaagtatgt aaatacctct 720 

ctcactgaga aatttaaaaa aataatatgt cagggcagca tacaaaacag aaacteagga 780 

agaactgtct aacatgccac tatggttatt ttcaggatzg tggtaggatt attactacaa 840 

taaccttgtc gatcatccta ggtaattnga tttaggtgct tatccctcag ctgacgcttg 900 

tgv_gcacctc catcctattc taactttcac aatcttgaaa cctctacttt tcaatttaaa 960 

acttgacctt tttcacaata gtctrctaaa tattaatcct tttaaaaatt attaatgtat 1020 

gcattctacc atgtcttgtt gcccttctgt catttactat ttcaatgatc actgccttat 1080 

ctgtttatta ttccattact aaccactctc aggaoctgat ccaaattgga actttttaaa H40 

aaaatagaat ttttttttac tacaaatata cca etc teat gaatagaaaa ctatttttaa 1200 

aaaattaaaa gtacatgaat cccatcaact agaaataaca gctgctcaca tcatggagaa 1260 

tattctccca ggattttttr aggtgtatca gtagttttaa gaaaagtaaa ttgggtcaaa 1320 

atgtgtattc tatgttgcag ectgeatttt gcacctaaca gtttaccatg aatgtttttc 1380 

catgtcattt agctcatttt caatttgata gttaatggct ataaaaaatt ttatttgtaa 1440 

atatatggta ccataaacca aaacgtttat gttttgctgg gagatcattt tagatgtatc 1500 

ttcgtgccta cctgtgaaaa tttccttaga attcttaaag taaatttger gaattggatg 1560 

taaaagttta aaagacagta gatacacatt getaaattge tcttcagaaa agtgattccg 1620 

tttacttcct ctgaaaatac qtctttccct gtacctcacc caagtgtgta ttaccatctt 1680 

aaaattttta ctaaatataa tttaattca: tcttcattct ttaattaggg atrttctttt 1740 

ttttttttta gtttttatat ttttagagac atggtctcac tctgtcactc aggctggagt 1800 

gcagcggcac gatcatagtt cactgcagcc ttgaactcag gtgatctccc tetttggecf. I860 

cccaaagcgc tgggactaca ggcatgagac accacatcaa gcctgggaaa cttttaaaca 1920 

tgcataaaag tagagggtaa aatgacaccc cttaccaacc ctcactcagc ttcaataaac I960 

atcaacattc tge-gg-ett ttcatcttta ctaccctaca cacatttgcc ctttttttct 2040 

ttcctagaat attttaaggc aaactccgca Cgtgctttta catctctgtg tctctctcta 2100 

gtaagaatgt tcttttcagc aaaatcacag aattagecat tactccttta ttacctaata 2160 

cccagactgt ttaatttttc tagttgette aagtgtctgt ttacaattta tttgtttgaa 2220 

ttcatttcta agctactcag gaggctgagt caggagaatt gcttgaaccc aagaggcaga 2280 

ggttgcagtg agctgagatc gtgccactgt actccagcct gggcgacaaa gtgagactcc 2340 

atgtcaaaac aaaacaaaac aaaacaaaac aaaacaaaac aaacaacaca attgaataat 2400 

atccctcagt tattatttag agaagagatg actcaatata aatgtaatat gtgatcattt 2460 

ggtcacagaa tgaat-tcag zeattgetta ggcaagcctg gtaaaaaaaa aaaaaaaegg 2520 

gattccatga ttagtgtgtt gaaaaggggg ttgccaaagt gtaacagaca gagcegggtg 2580 

cagtggctca cgcctgtaat cccagaactt tgggaggeca aggcaggegg atcacgaggt 2640 

caagagattg agacaatccn ggccaacatg gtgaagcccc gtatcaacta aaaatacaaa 2700 

aacaattagc tgggcgtggt gacatgtgcc tgtagtccca gctactcagg aggctgaggc 2760 

aggagaattg cttgaacctg ggaggeggag gctgcagtga gctgagattg ctccactgca 2820 

ctccagcctg gecacagagt gagaticcat ctcaaaaaaa aaaaaaaaaa aaaaaa 2876 

<210> 23 

<211> 1052 

<212> DNA 

<213> Homo sapiens 

<400> 23 

tcgacccacg cgtccgccca cgcgtccgcc cacgcgtccg ggegggegea ggacgtgcac SO 

tatggctegg ggctcgcrgc geeggttget gcggctcctc gtgctggggc tctggctggc 120 

gttgetgege tccgtggccg gggagcaagc gccaggcacc gccccctgct cccgcggcag 1B0 

ctcctggagc gcggacctgg acaagtgeat ggactgeage acctcctgcc cccttccggc 240 

cgctttggcc catccttggg ggcgctctga gcctgacctt cgtgctgggg ctgcttzctg 300 

gctttttggt ctggagacga tgeegcagag agagaagttc accaccccca tagaggagac 360 

eggeggagag ggctgcccag ctetggeget gatccagega caatgtgccc cctgccagcc 420 

ggggctcgcc cactcatcat tcattcatcc attctagagc cagtctccgc ctcccagacg 480 

eggegggage caagctcctc caaccacaag gggggtgggg ggcggtgaac cacctctgag 540 

gcctgggccc agggttcagg ggaaccttcc aaggtgtctg gttgccctgc ctctggctcc 600 

agaacagaaa gggagectea cgctggctca cacaaaacag ctgacactga ctaaggaact 660 

gcagcazttg cacaggggag ggggtgccct ccttcctaga ggccctgggg gecaggctga 720 

cttggggggc agacttgaca ccaggcccca ctcactcaga tgtcctgaaa ttccaccacg 780 

ggggccaccc tggggggtta gggacctatt tttaacacta gggggctggc ccactaggag 840 
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ggctggccct aagatacaga cccccccaac tccccaaagc ggggaggaga tatttatttt 
ggggagagtt tggaggggag ggtggggggg gggaaaaaaa aataaaaaaa aaaattttta 
atttttaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 



<210> 24 

<21I> 1541 

<2i2> DNA 

<213> Homo sapiens 



<400> 24 

ggcacgagct 

gctggatatg 

ctattzggtc 

aacttaatca 

agtcagctct 

ataattccaa 

ttaagtacan 

atgggaccaa 

atacaaggtc 

tattcaagga 

actctcttgc 

gaatgctcct 

tccgtggacc 

gggctgtact 

taagtatgaa 

ccaaacttgt 

tfcttttattc 

atgaattaat 

ctatgaaatc 

tcaaaaggaa 

ttttctaaaa 

tttgtaacct 

catttccttt 

atatcttggt 

gcttgttttg 

ataaatgtct 



ccctcgatcc 
tgggac tag t 
tcctaagtga 
cagccgggtg 
ggctgctggt 
ctttcccgga 
tgggacttgt 
cgctgggtgg 
tatgggctct 
gaasaaggtc 
ctaatgaaac 
ggcctraaac 
ccgtgccagt 
tactgtgata 
aggagtaatt 
ccttaaagat 
taagtacact 
ttgatatgcc 
agtttttaaa 
gaattcttgc 
attctgtttg 
acgttcactt 
aatgtctctg 
agatgtattt 
ttagggaagg 
attgtgtaac 



tactctgtct 
attcctgggt 
caaaaggcca 
ctacatgctc 
^ctgatatta 
aattctcagt 
atcaggtctt 
atttctgtat 
ga taagtgga 
taaatctcaa 
ctngtccgat 
atcaccgtag 
gtcttggaag 
ctgaaaagct 
aaaaacaagc 
gttgttatta 
gattctgtga 
ctaaatacat 
agatatactt 
ctgccataca 
tgtgaaaagt 
atgatctgga 
ttctttggct 
catccctaga 
gcttcctcca 
aaaaaaaaaa 



ctctttgttt 
atggcactgt 
cctctaagga 
ttagggcctg 
gttgtaagtg 
tgrgcacatg 
tttagtgcaa 
gagaaaattg 
ttagccatgg 
aacatcctca 
ggatcctgga 
gaagggtttt 
tgtcaacgtg 
gtcctgctga 
aaaacaaaac 
actcgagtta 
atgtaccttt 
aaaggtgctt 
tctctttgtc 
gaaactctct 
acaagaataa 
tttataaaca 
ctaccatctg 
gcaggtcagc 
acttcgtgtg 
aaaaanaaaa 



tggagaagtt 
cctatgccat 
aatggcttct 
tcccaatctt 
gcctctctgc 
aaaacgggtt 
tgtggtcaat 
gttttgaatg 
gcttgtttta 
gcacagagga 
ttgatacaag 
taaaatttta 
tttttggatg 
agcagctata 
aagacttagt 
gttcttattt 
tttattacac 
caaaatatgt 
ctgaggtttt 
agcactccct 
caatacttac 
ttacttggta 
ttttgttttt 
ctccttcccc 
aaattgtgat 



caatttacca 
ctcttcacca 
ggtgtttggc 
gcatattaaa 
tggaatgagt 
tgaagaggga 
tggtgctttt 
ggcagcagot 
tctactggag 
ggaacgaact 
gttgagaaat 
cgcgcaaaac 
atcctgtatt 
tttgaaatat 
ttttaaatga 
cctctgttca 
agggaaagaa 
agaaacatta 
tcggtcttgt 
gaccttaagc 
aacttccatt 
taacgttttt 
gtttttatct 
taatgcgaat 
gttgaagtga 



60 
120 
180 
240 
300 
360 
* 420 
' 480 
540 
600 
660 
720 
7 BO 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1541 



<210> 25 

<211> 2079 

<212> DNA 

<213> Hono sapiens 



<400> 25 

ggcacgaggg aggcgcggct gcgggacctg actagattct acgacaaggt actttctttg 
catgaggatt caacaacccc tgtggctaac cctctycttg catttactct catcaaacgc 120 
ctgcagtctg actggaggaa tgtggtacat agtctggagg ccagtgagaa cazccgagct 1B0 
ccgaaggatg gctatgagaa ggtggagcaa gaccttccag cctttgagga ccttgaggga 240 
gcagcaaggg ccctgatgcg gctgcaggac gtgtacatgc tcaatgtgaa aggcctggcc 300 
ZZt^lW ttcagaga ^ t cactggctct gccatcactg acctgtacag ccccaaacgg 360 
ctcttttctc tcacagggga tgactgcttc caagtzggca aggtggccta tgacatgggg 420 
gattattacc atgccattcc atggctggag gaggctgtca gtctcttccg aggatcttac 480 
ggagagtgga agacagagga tgaggcaagt ctagaagatg ccttggatca cttggccttt 540 
gcttatttcc gggcaggaaa tgtttcgtgt gccctcagcc tctctcggga gtttcttctc 600 
tacagcccag ataataagag gatggccagg aatgtcttga aatatgaaag gctcttcgca 
gagagcccca accacgtggt agctgaggct gtcatccaga ggcccaatat accccacctg 
cagaccagag acacctacga ggggctatgt cagaccctgg gttcccagcc cactctctac 



60 
120 



660 
720 
780 
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cagatcccta gcctctactg ttcctatgag accaattcca acgcctacct gctgctccag 840 

cccatccgga aggaggtcat ccacctggag ccctacattg ctctctacca tgacctcgtc 900 

agtgactcag aggctcagaa aattagagaa cttgcagaac catggctaca gaggtcagtg 960 

gtggcatcag gggagaagca gttacaagtg gagtaccgca tcagcaaaag tgcctggctg 1020 

aaggacactg ttgacctaaa actggtgacc ctcaaccacc gcattgctgc cctcacaggc 1080 

cttgatgtcc ggcctcccta tgcagagtat ctgcacgtgg tgaactatgg ca tcggagga 1140 

cactatgagc ctcactttga ccatgctacg tcaccaagca gcccccccta cagaatgaag 1200 

tcaggaaacc gagttgcaac a-ttatgatc tatctgagcc cggtggaagc tggaggagcc 1260 

acagccttca tctatgccaa cctcagcgtg cctgtggtta ggaatgcagc actgttttgg 1320 

tggaacctgc acaggagtgg tgaaggggac agtgacacac ttcatgctgg ctgtcctgtc 1380 

ctggtgggag ataagtgggt ggccaacaag rggatacatg agtatggaca ggaattccgc 1440 

agaccctgca gctccagccc tgaagacfcga actgttggca gagagaagct ggtggagtcc 15 00 

tgtggctttc cagagaagcc aggagccaaa agctggggta ggagaggaga aagcagagca 1560 

gcctcctgga agaaggcctt gtcagctttg tctgcgcctc gcaaatcaga ggcaagggag 1620 

aggttgctac caggggacac tgagaatgta catttgatct gccccagcca cggaagtcag 1680 

agtaggatgc acagtacaaa ggagggggga gtggaggcct gagagggaag tttctggagt 1740 

tcagatactc tctgttggga a^aggacatc tcaacagtct caggttcgat cagtgggtct 1800 

tttggcactt tgaaccttga ccacagggac caagaagtgg caatgaggac acctgcagga 1860 

ggggctagcc tgactcccag aacttuaaga ctttctcccc actgccttct gctgcagccc 1220 

aagcagggag cgtccccctc ccagaagcac atcccagatg agtggtacat tatataagga 1980 

ttttttttaa gttgaaaaca actttctttt ctttttgtat gatggttttt taacacagtc 2040 

attaaaaatg tttataaatc aaaaaaaaaa aaaaaaaaa 2079 



<210> 26 
<211> 1947 
<212> DNA 

<213> Homo sapiens 
<400> 26 

tgtaaacaga ttggagaacc tagcaataag attcaaagct aatctggagc ataaaggcac 60 

agttcagaga cagaataaca gggatcacaa gcatgaatta aaaggaattt atttgctfcca 120 

agttcctaga tacaaccctc ccatgctgca cctctccact gtcggagcac gttccgaaaa 180 

acagaatgcc ttgatccctg gtgggtgcga aggcagttgt tagggatggc aggcattggt 240 

gggctccaaa agatgaaggc cccacacaca ggtgtgctgc atttgggatc tgtgtgggtg 300 

tttcttggac cctttcttct gggagtaggg tacacactaa cgtttaatcc gctgtctggg 360 

tgcatgncca cagtacggtg gctaaactcg aacatcactg caaataggac gctgagcagg 420 

tccgtctgtc atgtcacgcc actgcacagg tccttgtccc cacacgacgg ggagcacttg 480 

cgtcagatgt tattgaacag ctcgtctcgg gcaggggaag cggggagttg gggatattaa 540 

ttgggggttt taattctatt atcatgtcag ctgacattat gactatataa tgtagttaga 600 

gacaattttt atcttgctta tagtaaaggt tcagcctgcc aattgtaaat cattctaatt 660 

tggcaggctt atttttgaca ttggaaaggg cagaaagcga tttgccccag tagtgtaata 720 

ggagttatag accagaggct gaaacccaaa ctatataaaa aggaattcag tggagggggc 780 

tttgtaatct ccattaattt gtgttgctac ttccaggatc accaaaaatr acatgcaact 840 

ttacatgtta aacacattga aacataacct atgtttataa agcataacgg gcttcccttc 900 

cagaagctct cctgcttgtc atgaagtgag aacaatgaaa agtcatagca gatactcagt 960 

ttaactctgt gtagaaccta gtagtgtttg agctgttatt cagarttgaa ttcagactgt 1020 

gtgttgtttg cttatggaca ctgcctgtcg ttctgtcact gttaaattaa tgagtctata 1080 

aggttrttcfc tccagaggcc ataggtgaca tcactaaaat tgcaagacaa antgtaatct 1140 

ttgctgctgc tgcactcccc aacctctccc ccaccccccg tggtgtgctg ctttctagat 1200 

gagcgtgttt tggagcaggc ccatctggga cactctatgc tttcaccaag gaagtgcgat 1260 

ctgagcagcc acaatccagc caaaagagga ttgtagatat ttgctctgat caactagatg 1320 

aaaatatagc agaatggatt cagcccactg ctctgtttta tccaactgag tctcngacca 1380 

gcaattggtg cataattatt acagcaaaag ttaagaaatg aaactgtagc aattatgtaa 1440 

atgaatgtgt tggcctctta atacctgtta ctagtggact tcctgtgagg aagttagttt 1500 

tttgttttga tgaaatgctt: tcgtttttta aar.cttaatt cf.gctgrcca catcctccca 2560 

aagtgtgctu acttcatttg tttaattraa atgaactttc ctccttgtat gtatgaggtg 1620 

acttgg-ggg tggggtgggt ggtttttgtt tttgtgtttt ttctttctta gggcatctgr 1680 

aggcctcaaa ggacctttcc tctaggtcac attcctcaga aagtcttcaa tcttcccttg 1740 
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tttttgtttg tctgttcttc ctaaagaata tttrcaaagc ttaaatttgt atattaatxt 1800 

aggactatct agaagtatag gctgtcgttg gcggcagcag tatattctga aacgtctcat i860 

agatatatat ^tttgaataa agatggcgtt gttgaacaaa aaaaaaaaaa aaaaaaaaac 1920 

tcgagggggg gcccggtacc caattcg 1947 



<210> 27 

<211> 3379 

<212> DNA 

<213> Homo sapiens 

<400> 27 

atgatattca agtcgaattc ccctcactaa agggaaaaaa gctggagctc caccgcggtg 60 

gcggccgctc taaacctact ggatcccccg ggatgcagga attcggcacg aggtctcggc 120 

tccctccgge acccgcccgg cccggctgct cccggctcct cccggccatg gggagctgcg 180 

cgcggctgct gctgcrctgg ggctgcacgg tggtggccgc aggactgagt ggagtagctg 240 

gagtgagttc ccgctgtgaa aaagcctgca accctcggaz gggaaatttg gctttggggc 300 

gaaaactctg ggcagacacc acctgcggtc agaatgctac cgaactgtac tgcttctaca 360 

gtgagaacac ggatctgact tgtcggcagc ccaaatgtga caagLgcaat gctgcccatc 420 

ctcacctggc tcacctgcca tctgccatgg cagactcatc cttccggtct cctcgcacat 480 

ggtggcagtc tgcggaggac gtgcacagag aaaagatcca gttagacctg gaagctgaat 540 

tctactccac tcacctaatt gtgatgttca agtcccccag gccggctgcc acggcgctgg 600 

accgctccca ggactttggg aaaacatgga agccttataa gtactttgcg actaactgct 660 

ccgcracatt tggcctggaa gatgatgttg tcaagaaagg cgctatttg- acttctaaat 720 

actccagtcc tcttccatgc actggaagga aggttatttt caaagctttg tcaccaccat 780 

acgatacaga gaacccttac agzgccaaag ttcaggagca gctgaagatc accaacctcc 840 

cgcgtgcagc tgctgaaacg acagtcttgt ccctgtcaga gaaatgacct gaacgaagag 900 

cctcaacatt ttacacacta tgccatctat gatttcattg tcaagggcag ctgcttctgc 960 

aatggccacg ctgatcaatg catacctgtt catggcttca gacctgtcaa ggccccagga 1020 

acattccaca tggtccatgg gaagtgtatg tgtcagcaca acacagcagg cagccactgc 1080 

cagcactgtg ccccgttata caatgaccgg ccatgggagg cagctgatgg caaaacgggg 1140 

gctcccaacg agtgcagaac ctgcaagtgt aatgggcatg ctgatacctg tcacttcgac 1200 

gttaa^gtgt gggaggcatc agggaatcgt agtggtggtg tctgtgatga ctgtcagcac 1260 

aacacagaag gacagtattg ccagaggtgc aagccaggct tctatcgtga cctgcggaga 1320 

cccttctcag ctccagatgc ttgcaaaccg tgttcctgcc atccagtagg atcagctgtc 1380 

ctrcctgcca actcagtgac cttctgcgac cccagcaatg gtgactgccc ttgcaagcct 1440 

ggggtggcag ggcgacgttg tgacaggtgc atggtgggat actggggctt cggagactac 1500 

ggctgtcgac catgtgactg tgcagggagc tgtgacccta tcaccggaga ccgcatcagc 1560 

agccacacag acatagactg gtatcatgaa gttcctgact Lccgtcccgt gcacaacaag 1620 

agcgaaccag cctgggagtg ggaggatgcg caggggtttt ctgcacttct acactcaggt 1680 

aaatgcgaat gtaaggaaca gacattagga aatgccaagg cattctgtgg aacgaaatat 1740 

tcatatgtgc taaaaataaa gattttatca gctcatgata aaggtactca tgttgaggtc 1800 

aatgtgaaga ttaaaaaggt cttaaaatct accaaactga agattttccg agggaaagcg i860 

aacattatat ccagaatcat ggacggacag aggatgcact tgtccaatcc tcaatcctgg 1920 

t:ttggaatac ctcgtagcag gacatgagga tacaagaaca ggcaaactaa ttgtgaatat 1980 

gaaaagcttt gtccagcact ggaaaccttc tcttggaaga aaagccacgg atattttaaa 2040 

aagagagtgc aagtagcatt aagatggata gcacataatg gcactrggct atgtccaaaa 2100 

cacaaacttt agagcaagaa gacctcagac aggaaactgg aattttttaa agtgccaaaa 2160 

catatagaaa tgtttgaatg catgggtctt tatctaattt atctcttctg gacccatgtt 2220 

taaatacagc ttcatttcat gaagagaaat gaaaacccct acactgatat ctgttctcta 2280 

tgggactgat tctgaaattc ttaactatta agaatatntt aatagcagca tgacatttag 2340 

cagtaatcca ttaagggcag tacctctaac aaggacgcct tccagcttca gctatgttac 2400 

ttacgtttga tgctacttaa agtaatgaat gacgztttaa ggaatcccta accctactat 2460 

cagaaaaggt gttcgttaaa gagccttctc ctgtgcgtta cgcatgaact ttgctctgta 2520 

ggtgttaaat ggaccctctc catgtgtata tagtatttcc ttgtataaag cactttacta 2580 

cctaccactt gtgttgtgaa cgtttggtga ccgctgttga aagaaggaaa agggegtgtg 2640 

agaaagccta ctgaagcagc agcactgcca ctacatgtgg acaaaagtga acatacaaaa 2700 

gaagctgtgc tatttaactc tgaatacttg gagaaactag gtgaagatgc aaccagaaag 27 60 

gagaatargt atgcgLgaag tctcagcttt gagctggagg ctagactcca agatgacagc 2820 
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catgatgaaa 
ataaatgtag 
ttgcaatgca 
gtgcttagta 
atagtcatta 
ttctcattac 
cttgaatata 
tttctetgaa 
tatatttcat 

gggggggccc 



ctttttaaaa 
acatatgctt 
cttgtataca 
cttaaacctg 
tatttacctg 
tattcatctt 
caaaacatga 
g-cctgtgga 
atatgtaata 
gtacccaaa 



aactaaacca 
ggctaaaggg 
cttcaaaaat 
aacat cgaaa 
tttttcaaca 
ctgtacccac 
acacagtgcc 
cttgcaaata 
aaggaatatg 



gaagagactt 
gaaatggact 
tattgcagac 
cagttttcct 
caatg^atgc 
gcataaccac 
atatgaataa 
tatatatata 
atctgaaaaa 



taaaataaga 
ttaaatttta 
acagaatttg 
ccttgtcttt 
gatagtcaaa 
tatacatagt 
-ttcacatac 
ttgctttgtt 
aaaaaaaaaa 



gaaagaaatc 
aagagctcat 
ttatattttt 
cttaacagta 
aaatcacagt 
ttcttttgta 
agaacctttt 
aatttgtttt 
aaaaactcga 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3379 



<210> 28 

<211> 2006 

<212> DNA 

<213> Homo sapiens 

<400> 28 

cccacgcgtc cgcggacgcg tgggattttc LcLcaagaaa aaaaaaggaa gacgtcttaa - 60 

taggaaaaga aaaacaagaa gcagtagcag ctaggtaatt taaacgtctt cctttttttt • 120 

tcttaagaga aaatggaaca tttaggttaa atgtctttaa attttaccac ttaacaacac 180 

tacacgccca taaaatatac ccagtcagta ctgtacttta aaatcccttg aaatgatgat 240 

atcagggtta aaattacttg tattgtttct gaagtttgct cctgaaaact actgtttgag 300 

cactgaaacg ttacaaatgc ctaataggca tttgagactg agcaaggcta ct^gttatct 360 

catgaaatgc ctgttgccga gttattttga atagaaatat tttaaagtat caaaagcaga 420 

tcttagttta agggagtttg gaaaaggaat tatatttctc tttttcctga ttctgtactc 480 

aacaagtctt gatggaatta aaatactctg cr.ttattctg gtgagcctgc tagctaatat 540 

aagtattgga caggtaataa tttgtcatct ttaatattag taaaatgaat taagatatta 600 

taggattaaa cacaatttta tacggttagt actttattgg ccgacctaaa Cttatagcgt 660 

gtggaaattg agaaaaatga agaaacagga cagatatatg atgaattaaa aatatatata 720 

ggtcaatttt ggtctgaaac ccctgaggtg tttttaacct gctacactaa tttgtacact 780 

aatttatttc tttagtctag aaatagtaaa ctgtttgcaa gtcactaata atcattagat 840 

aaattatttt cttggccata gccgataatt ttgtaatcag tacuaagtgc acacgtattt 900 

ttgccacttt ttcctcagat gattaaagta agtcaacagc ttattttagg aaactggaaa 960 

agtaataggg aaagagattt cactatttgc ttcatcagtg gtaggggggc ggtgactgca 1020 

actgtgttag cagaaattca cagagaatgg ggatttaagg ttagcagaga aacttggaaa 1080 

gttctgtgtt aggatcttgc tggcagaatt aactttttgc aaaagtttta tacacagata 1140 

tttgtattaa atttggagcc atagtcagaa gactcagatc ataattggct tatttttcta 1200 

tttccgtaac tattgtaatt tccactcttg UataaLtt: gatttaaaat ataaatttac 1260 

ttattcattc ttttaatagt caaaaatctt tgctgtzgta gtctgcaacc tctaaaatga 1320 

ttgtgttgct tttaggattg atcagaagaa acactccaaa aattgagatg aaatgctggt 1380 

gcagccagtt ataagcaata tagttaacaa gcaaaaaaag tgctgccacc ttttatgatg 1440 

attttctaaa tggagaaaca tttggctgca tccacataga cctttatgtt tcgttttcag 1500 

ttgaaaactt gcctcctttg ycaacattcg taaatgaagc agaatttttt cttc^ctttt 1560 

ttccaaatac gtcagttttg ttcttgtaag atgcatcatg ggtatCggtg ctgtgtaatg 1620 

aacaacgaat tttaatcagc atgtggttca gaatatacaa tgtcaggtt- ttaaaaagta 1.680 

tcttgatggt tcttttctat tcataacttc agactttcat aaagtgtacc aagaatttca 1740 

taaatttgtt ttcagtgaac tgctttttgc tatggtaggt cattaaacac agcacttact 1800 

cttaaaaatg aaaatttctg atcatctagg atattgacac atcccaattc gcagtgcctt I860 

tttgactgga tatattaacg ttcctctgaa tggcattgat agatggttca gaagagaaac 1920 

tcaatgaaat aaagagaata ttnattcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1980 

aaagggcggc cgctctagag gatccc 200G 



<210> 29 

<211> 3070 

<212> DNA 

<213> Homo sapiens 
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<400> 29 

gttggagcct tagaccttcg aagtgaccta tagaaggtac gcctgcaggt accggtccgg 60 

aattcccggg tcgacccacg cgtccgcaac gtatgaggag tctgattcct tcacatcctt. 120 

gtcaacattt gttgttgttg ttg-tgttgt tgtttttaat tttagctatc ctagtagatg ISO 

igaagtggta tctcgcgct^ tttatttgca tttctctaat gactagtgat gttgaacacc 240 

ctttcatgtg cttgttggcc attcgtatat cttcttggag aaatgtctat taagtccttt 300 

gcccaatttc aaattggctt g-ctztttgt tgagttataa ccattgaatt ttaattgccg 360 

taaaacactc catgagtaca tgtagcargc actcaataac tattagctaa tactatagtt 420 

gtaattattt ttgccacatg tggcctgaat tggaaggaaa ttgcctagtt tttccaagta 480 

ggagcggaga tgtaacaaaa cctgtattta aggccaataa agatttgact ccgtcagtgt 540 

tgatactttg cttttaattg atgctcccaa atcaagaaat tagaaacgtg tatacataag 600 

aatcagttta tctttggcag aaatcagtat attatcaatg tttaattaat tcctaagaat 660 

taatggcaat agtgcctcta gttttagctg agatttagat catccttctt cattctgccc 720 

tgttatgcga catttgctat aaatgccaca atgagaattt tttgaaacct gggtcagaga 780 

agcctacccc ttatatttaa caagtatgtg attgatgtat tatcattggc ttatgcttat 840 

tgcatcttgg ttttatgggg tgttcaaaag aagatagatg agaataggtg atccaaagta 900 

atttagaatt ttatcctcca atcgggactg aaccatcttg aactgatgaa tatattctac 960 

caggtttata ggtagatatg ccgtttcatt atatcggtga tttggagtga gacagtagct 1020 

tttoagcaga acaacagcca tatgtaayLc taaatttggt tattttcttt gaataagcaa .1080 

tggaaaccgc atgcattcag craactggtga ttcagatatc ttctgaaggc cctctctccc * 1140 

ctgtgctccc acagcttctg tcctgaactc tcctgtctcc ttgcgatgcc tctgtgccta 1200 

tacctatttr gtgaac-ctg cct.gcaatgc cgtcccaagt cagttggtga ctcctcaaac 1260 

ctcactgaag tctttcgggg cggattcaaa tgctgctttt agccattaaa atctgrtaca 1320 

tctgttacat actttcacaa tagcacctat cctagtgtag ttttaaaaat gcacccaaaa 1380 

atttttaaag cagtgcacgc atgcagtaaa caaaattgga cagtacagaa acaaatggtg 1440 

aagacagtct cccagttttc ttccccatgg atagcacttt ctctttttta tccttccaaa 1500 

gatattttgt gctgatacaa gaantt.ttat accgtacgaa gtgcctattc atcttatctt 1560 

ttacttcagg gtaatttatg agtaatgaaa aaattcaagc attacaaaaa tgtagagtgt 1620 

gtgaagcaaa aggtcccctt tactttgatc cctaacacta gccctttcca aacaacactg 1680 

tcctaatcaa cactcccctc tccagaagta acctctgtta taagccttgt ctgtgccttc 1740 

agattgttct gcttacactt atatgcctat ttaaaagtgt atatttttgt attttgtttg 1800 

tttgtttgtt tgagacagag tttcgctctt gttgcccagg ctgaagtgca atggcactat i860 

ctctgctcac tacaacctgt gtctccctgg ttcaagtgat tctcctggct cagcctcctg 1920 

agtagctagg attacaggtg cacgccacca tgcccggcta atttttgtat ttttagcaga 1980 

gatggagctt caeca tgttg gecaggctgg tcacgaactc ctgacctcag gttatccacc 2040 

caccttggcc tttcaaactg cegggattae aaggcatgag ccactgtgcc eggecaagaa 2100 

tttt:tttatc gataccatag tc/agctctct gcctcttcgg aacgatgtcc actttgetta 2160 

tgatcaaccc aagcaggact cttctctccc tggacgcctc tcccctggtc tggaatcttc 2220 

cagtzctgcc agaattggee tttcccagat getgeaaact tccagttgaa cccctctttc 2280 

tgcgcggccc ctggggctgc gagaccaaaa tccatgagtt ctgtgtaccc tagacctttg 2340 

gaaggtgaga gcagggccct gagaaaaggc aaaacgcctc ctctccctga tgatacccct 2400 

acccgcccta ctcctcacca gaattgtcag tggectttea ccacagtggt ccttcctgcc 2460 

tgagccctgc actgtcccag accacacaga agtctggtca cctctgggcg cctgggatgg 2520 

tcaccgaaga gaagcacijct gtccccgtct tctcctggct tetgecagaa aatcgaacaa 2580 

g^gcaattaa cacactgtta ctgccgaagc ctgaaactcc caggacttgt ccttgatcct 2640 

tccagaaacc accaggtccg gcacttggag ccccccggac agggacctcc cagccgagcc 2700 

ctcaaagaac tccatgaaat caggaactgc ttgatgaaat gtatctcctt gtacctggaa 2760 

gatgaagece aaacacccac acctctgtct cccccagggc tcgggatgtc tccagcagcc 2820 

cggccacgca gcttcccagg tgggctcggg gaggtgggag cagggaccat ctctgtcccc 2880 

tccaccctca ctccatccac ctcggagacc accctccccc agecagatae ggaataaaac 2940 

tacagacgea gaaaaaaaaa aaaaaaaaaa gggcggccgc tctagaggat cccbcgaggg 3000 

gccccagctt aegegtgeat gegaegtcat agctczcccc ctatagtgag tegtattata 3060 

agggagcaaa 307Q 



<2l6> 30 
<21L> 2227 
<212> DNA 

<213> Homo sapiens 
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ttagc-tatt ttcaatttga tagttaatgg ctataaaaaa ttttatttgt aaatatatgg 1440 

taccataaac caaaacgctt atgttttgct gggogatcot tttagatgta tctttgtgcc 1500 

tatctgtgaa aatttcctta gaattcttaa agtaaatttg ctgaattgga tgtaaaagtt 1560 

taaaagacag tagatacaca ttgttaaatt gttcttcaga aaagtgattc tgcttacttc 1620 

ctctgaaaat acctctctcc ctgtacctca cccaagtgtg tattaccatt ttaaaatttt 1680 

tactaaatat aatttaattc attcttcatt ttttaattag ggattttttt ttrttttttt 1740 

tagtctttat acttttagag acatggcctc actctgtcac tcaggctgga gtgcagtggc 1800 

acgaLcatag ttcactgcag ccttgaactc aggtgatctc cctctttggc ctcccaaagt 1860 

gctgggatta caggcatgag acactacatc aagcctggga aatxtttaaa cnngcataaa 1920 

aagtagaggg taaantgcca ccccttncca accctcactc agcttcaata aacatcaaca 1980 

ttctgctggt cttttcatct fctactaccct acacacattt gccctttttt tctttcctag 2040 

aatattttaa ggcaaattcc gcatgtgctt ttacatctct gtgtctctct ctagtaagaa 2100 

tgctcttttc agcaaaatca cagaattagc cattactcct ttattaccta atacccagac 2160 

tgtttaattt ttctagttgc ttcaagtgtc tgtttacaat ttatttgttt gaattcattt 2220 

ctaagctact caggaggctg agtcaggaga attgcttgaa ccctagaggc agaggttgca 2280 

gtgagctgag atcgtgccac tgtactccag cctgggtgac aaagtgagac tccatgtcnt 2340 

aacaaaacaa aacaaaacaa aSacatgacga aacaaacaac acaaztgaat aatatccctc 2400 

agttattatt tagagaagag atgacccaat ataaatgtaa tatgtgatcc atttggtcac 2460 

agaatgaatt tcagtcattg cttaggtaag cctggtaaaa aaaaaaaaaa cgggattcca 2520 

tgattagtgt gttgaaaaag gggttrtgccc aaagtgtaac agacagagcc gggtgcagtg 2580 

gctcacgcct gtaatcccag aactttggga ggccaaggca ggcggatcac gaggtcaaga 2640 

gatcgagaca atcctggcca acatggtgaa gccctgtatc aactaaaaat acaaaaacaa 2700 

tcagccgggc gtggtgacar gtgcctgtag tcccagctac tcaggaggct gaggcaggag 2760 

aattgcttga acctgggagg cggaggctgc agtgagctga gattgctcca ctgcactcca 2820 

gcctggccac agagCgagat tccatctcaa aaaaaaaaaa aaaaaaaaaa a 2871 



<210> 112 
<211> 1037 
<212> DKA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (936) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (946) 

<223> n equals a,t,g. or c 
<220> 

<221> SITE 
<222> (951) 

<223> n equals a,t,g, or c 
<400> 112 

ccggtccgga atucccgggt cgacccacgc gtccgggcgg gcgcaggacg tgcactatgg 60 
ctcggggctc gctgcgccgg ttgctgcggc tcctcgtgct ggggctctgg ctggcgttgc 120 
tgcgctccgt ggccggggag caagcgccag gcaccgcccc ctgctcccgc ggcagctcct 180 
ggagcgcgga cccggacaag tgcacggact gcagcacctc ctgccccctt ccggctgctt 240 
tcgcccatcc ttgggggcgc tctgagcctg accrtcgtgc tggggctgct ttctggcttt 300 
Ctggtctgga gacgatgccg caggagagag aagtt caeca occccatago ggagacegge 360 
ggagagggct gcccagctg- ggegctgate cagtgacaat gtgccccctg ccagccgggg 420 
ctcgcccact catcac:ca: -ca:ccattc tagagccagL uLcLgccLcc cagacgcggc 480 
gggagccaag ctcctccaac cacaaggggg gtggggggcg gtgaatcacc tetgaggect 540 
gggcccaggg ttcaggggaa ccttccaagg tgtctggttg ccctgcctct ggctccagaa 600 
cagaaaggga gcctcacgct ggctcacaca aaacagcr.ga cactgactaa ggaactgcag 660 
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<220> 

<221> SITE 
<222> (289) 

<223> n equals a,t,g, or c 



<400> 30 

cggacgcgtg ggncgaccca cgcgtccggg aaaaarggaa aaratgccgt gtaaaatctc 60 

gttctgtgtc tgaattgccg caggctcaga tct-catttg aggttctgtg tctgaattgc 120 

cgtaggctca gatcttcatt tgaggttatg ttctataagt taacgttgat cttgtgtgag 180 

cttccggtag ctggagtaac acaggcggcc tcacagcgac ctctccagcg ccttccaagg 240 

cacatctgca gccagcgtaa tcctcctggg agatgcctcc tcaaggccnt gctccagacc 300 

acgtggggar ggcctgacar ccaattccca ggctgtcccc acccttgrag agtgacccta 360 

aacgctagac agatggggaa tgggaaagaa aagaaagctg cagacctcaa gttaaaattc 420 

cctcaaaaac gtttLLactc atctgctttt tctgaaagga taaaggctrt ctgaaaatta 480 

ttttctaaca aataacatga acacttctag aaaccctaga aaaacacaaa gtattcaaaa 540 

tagaaagaaa aattacccat taj;tctttaa gccagcatta tccattgcgg cgcttttgga 600 

gttgggtgag gccgtagcct ctigccaagtc aaggagcccg gtggtggctg tggcattcct 660 

gcagggttgt ttttttttct ttgagatgga gtctcactct tgtcacccca gctggaatgt 720 

ggtggtgtaa acagctcact gcagccttga ccctgaggct caagcgatcc ttctgccttg 780 

gcctcctgag tagctgggat cccaggcgag agtcaccaca ccctgtccat gttcctgcag 840 

gtcttgatat gcgaggacgc tgtgtcttcc ctgccacatt ttctrcttct ttcttgagac 900 

agacccttgc tccatcaccc aggccagagt gtggtsgtgc gaacacggct cactgcagcc 960 

tcgaccctca ggctcaagcg atcctcacgc ctcggacccc caaagtgctg ggatcacagg 1020 

cgagagtcac catgctggcc tgaatcttca gggtatttca cggttgaagt gtcacttact 10B0 

tarccatscc cgtttcaaga gtgtaggtgg tcaccctgtc tctgycgctg acctggcctg 1140 

gaccctcggc tgtgagaggg aggggtgggc tgggctggag gaacctraag ccctcgtgat 1200 

gtcacaagcc catccggctg ggcatcccct gctgtgtcct gagctgcaca tgccccaggt 1260 

ggcccccaca gcagaggcga gccactgrag ggtgragggc ttccacggac ggtcttcagg 1320 

ggragaagaa gggcccaggc ccccaggaga ctcaggagac cagagcctgg ggtcaggggc 1380 

tyagcagggg ctyarccagg gctggatgtc cggagccagc cccgmagccc tgJcgktcttt 1440 

gttcttcgca ctcccaccgt ccgtgtgaac agctccagcc ccacctgcgc ctccctgtgc 1500 

tgggctccat cagggagccc agaagacgtg tgtgcttctg aaattgggtc cctacatgcc 1560 

tttgtcccag tgcaccttgc tccttccatt tacfcatcgag atttaaatgc ctgttttctc 1620 

cccagaggtt gacggatata ttcagacgtt acgacacgga tcaggacggc cggattcagg 1680 

tgtcgtacga acagtacctg tccatggtct tcagtatcgt atgaccctgg cctctcgtga 1740 

agagcagcac aacatggaaa gagccaaaat gtcacagttc ctatctgtga gggaatggag 1800 

cacaggtgca gctiagatgct gttcttcctt tagattttgt cacgtgggga cccagc-gta 1B60 

catatgtgga taagctgatt aatggttttg caactgcaat agtagcrgta tcg-tc-aat 1920 

gcagacattg gatttggtga ccgtctcatc gcgccacgag gtaaangtaa tgtctcaggc 1980 

attctgcttg caaaaaaatc tatcatgtgc ttttctagat gtctc-ggyt ctatagcgca 2040 

aatgctttta ttagccaata ggaattttaa aataacatgg aacttacaca aaaggctttt 2100 

catgtgcctt acttttttaa aaaggagttt attgtattca ttggaatatg tgacgtaagc 2160 

aataaaggga atgttagacg Cgtaaaaaaa aaaaaaoaaa aaaaaaaaaa aaaaaaaoaa 2220 

aaaaaaa ???7 



<210> 31 
<211> 1288 
<212> DNA 

<213> Homo sapiens 



<400> 31 

cccccgggct gcaggaattc ggcacgagcc tgacctcccc agcctcatct ctccctcctc 60 

tgctcccgcc cnctgtgctc cagccaacgt ggcctgtcac tcgtccacct gccatactgt 120 

cctgactcca ggcctttgcc tgtgctatgg cctctqctgq gaccactictt gtctctcccc 180 

tgctgtgtct gctaattccc actcgtgtca gtgacccatg gctgcagaac acaccactcc 240 

atccatggaa aacaatcaca atcactgatt actatctctc cctgggcttc ctggggtgga 300 

ctgggcccag ctgggtggtt cactttgggg cctcagcdgt catyyycaga cagtggctgg 360 



WO 00/06698 



PCT/US99/17130 



19 



ggtcactgca aagactrccc tgcatctctg gcagttgatg ctgactgoca tctgagacac 420 

ctacccaggg cctctccctg gggcctgggc tcctgcctag cttggttggg aggctccaag 480 

accaacatcc caagaaagat gagacagaag ccagatcacc tttttgggcc tggcttcaga 540 

agtcacccag catcacttct gctgcattta tttcttaaaa cacaaatatc aaaccccatc 600 

tcttgatggg agggggcctc atggcttgta aacatgttct aaactccact ctgcccggcc 660 

ttggctcaac gtgtctggta atgtgtgggc tgtgaggctc cccgaacgta aacctcagac 720 

tgcaacgctg gccgttacag gctctggcac acgggcccac gtcaggccca tcgccacagt 780 

gatggttgtt ctgtgactgt ttctggtggc ctctgctcca cactccaggc tgacgctgtg 840 

ccccttccac tgggaccctc gggtggcttc catgcacttg tgccctaaat cctgctccta 900 

gactaaactt catctcctgt gttctcattc tgcagcatgg ctgttaggga acctgaccat 960 

ctgcagcgcg tctcgctgcc aaggtataat gtcagtgcct cccttcagtg gctcccatgt 1020 

cacagaattg tcctgcagcc ctggcacatg tgtgccatgt gggagctggg gcaggtcctc 1080 

tttcatcctg tggctccgag ggagggggcc gctccttccc cagcctctac cctgacttgg 1140 

ccctcgtcct gcagccactc agagagcacg atggagctgg agcttcagtt ttgaccaaat 1200 

gcgtgtcgcc ggcttttgrg tgtgtgtgty Lgtgacagag ccagaccctg tccctaaaaa 1260 

aaaaaaaaaa aaacccgagg gggggccc 1288 



<210> 32 
<211> 3280 
<212> DNA 

<213> Homo sapiens 
<400> 32 



gcgtccggtg agaggcaagg acttttcatc tttagggcat tctgaccacg tcctgcttca 60 
gagagattgt tcccggcgtc tcagtgctat ggggagcagg ttcctcctgg tcctgctctc 120 
aggtctcact gtcttactgg ctctgccagg atcagaagcc aagaattctg gagcttcetg 
tcctccatgc cctaaatatg ccagctgcca caacagcacc cactgtactt gtgaagatgg 
crttcgggcc aggtctggca ggacatactt tcatgattcc tctgagaagt gtgaagatat 
taatgaatgt gaaaccgggc tggcaaagtg caagtataaa gcatattgta ggaataaagt 
tggaggctac atctgtagct gtttggtaaa atatacttta ttcaactttc tggctggtat azu 
tatagattat gatcatccgg attgttatga gaacaatagt caagggacga cacagtcaaa 480 
cgtggatatt tgggtgagtg gggtgaagcc tggatttggg aaacagctgg tacgtataac 540 
tatgccatcc ccctacccaa acattaacat gtcttcctgt gatttttagg gtagggtagt 
tctatccagg ggtaattttg tcctctgtcc caaggtcatc tgtcaatgac cggggacact 
tttggttgcc ataccttggg ggtgatgtgt gtgactggca tctggtggat agagaccagg 
gatacagctx aacatcctac agtgcccagg acagcctccc acaatcaaga agr.gcctagt 



180 
240 
300 
360 
420 



600 
660 
720 
780 



B^ayv.un.n. tt^attLUddga agr.gcctagt 780 
gccatatgtc catagagata gagaaataca agtgtagggg gaaagtgcct cagctggca- 840 

rraflan^rr-r j...^^ . 



900 
960 
1020 
1080 
1140 
1200 



- - "a'-a'-^yauy y dcicxy Lyn. u cagctggca- 

-caaagacct acatcaagca cctagattct caatgccaca cgcaccttgt agcaccrtaa 
taaatatcgt tgccttctgg gctccccact ccaacacttg tgcatattcc ccanttctta 
catttcagta agagtcaatc taattagctt aatttttttg gaaggaaaac ctgagaagaa 
atggaagcag agaggacttt gcaagaaggg ctacccaact aattcaaagc gtggagttga 
gcatctggaa tgcgagtttt gcttctccag gaaagggtca aatttctgaa tttgatatag 
tctatgaaac caagaggtgc aatgagacaa gggagaatgc ttttctggaa gctggaaata izoo 
acaccatgga tatcaactgt gctgatgctt taaaaggaaa cctaagagag agcactgcag 1260 
ttgccctatc acttatcaat ctcttgggga tattctgaat gcatcctttt ttagtaaacg 1320 
aaaagggatg caggaagtaa aactgaactc ttacgttgtg agcggcacca tcggtttgaa 1380 
ggaaaaaatt tccctctctg aacctgtgtt cctgactttt cgccazaatc agcctggtga 1440 
caagagaaca aaacatatct. gtgtctactg ggagggatca gagggaggcc gctggtccac 1500 
ggagggctgc tctcatgtgc acagcaacgg ttcttacacc aaatgcaagc gcttccatct 1560 
gcccagcctt gccgtcctcg tggctcttgc ccccaaggag gaccctgtgc cgaccgtgat 1620 
cacccaggtg gggctgacca tctctctgct gtgcctcttc ctggccatcc tcaccttcct 
cctgtgccgg cccatccaga acaccagcac ctccctccat ctagagctct ccctctgcct 
cttcctggcc cacctcctgt tcctgacggg catcaacaga actgagcctg aggtgctgto 
ctccatcatt gcagggctgc tgcacttcct ctacctggct tgcttcacct ggatgctcct 
ggaagggctg cacctcttcc tcaccgtcag gaacctcaag gtqqccaact acaccagcac 
gggcagattc asgaagaggt tcatgtaccc tgtaggctac gggatcccag c-gtgattat xsao 
tgctgtgtca gcaatagttg gaccccagaa ttatggaaca tttactcact cttggctcaa 2040 
gcttgacaaa ggattcatct ggagcttcat ggggccagta ycagtcatta tcttgataaa 2100 



1680 
1740 
1800 
1860 
1920 
1980 
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cccggtgttc tacttccaag ttctgtggat tttgagaagc aaactctcct ccctcaataa 2160 

agaagtttcc acca~tcagg acaccagagc catgacattt aaagccattt ctcagctatt 2220 

tatcctgggc tgttcttggg gccttggttt ttttatggtt gaagaagtag ggaagacgat 2280 

tggatcaatc antgcatact cattcaccot catcaacacc cttcagggag tgttgctctz 2340 

tgtggtacac tgtctcctta accgccaggt tcgaacggaa tacaaaaagt ggtttagtgg 2400 

gatgcggaaa ggggtagaaa ctgaaagcac tgaga^gtct cgctctacta cccaaaccaa 2460 

aacggaagaa gtggggaagt cctcagaaat cttccataaa ggaggcactg catcatcatc 2520 

Cgcagagtca accaagcaac cgcagccaca ggttcatccc gtcictgctg cttggctaaa 2580 

gatgaacrga cctggcaagt gccatggcaa cgacccggaa gttacccctc ctttccgttt 2640 

gtctacagcg cccctgcggt cacacacaga ttggacaaat gccactattc ctagctttcc 2700 

tgtgaaaagt ctaggctcat ncacctattt tggcttttta tgttcataga aagaacaaga 2760 

catttgggag aattcttaga tccagagtcc agtagtgtgg cacgtgcaat gaagtgtcgg 2820 

aaggatgcat tttaaagatg gcgggcggga gaagtggatt ttcttcttgc agctactgcc 28B0 

accttgccag aaactcacta actggcatct ggattcagct catagttccc tttctggcct 2940 

ctctgccgca ctttatgccc ccaaagacct tacactaaca ctccacactc acataattca 3000 

acaattttca tatggatcag Cattaaagag ggtgttgcat tttgcaatac aaaaatgcat 3060 

-atcaggtgc tggagaggat grggagaaat aggaacactt ttacactgct ggtgggactg 3120 

taaactagtt caaccattgt ggaagtcagt gtggcgattc ctcagggatc taggactagg 3180 

aataccattt gacacagcta tcccattact gggtatatac cccaagggct ataaatcaty .3240 

ctgctataaa gacacatgca cacgtaaaaa aaaaaaaaaa ,3280 



<210> 33 

<211> 1297 

<212> DNA 

<213> Homo sapiens 



<400> 33 

ccacgcgtcc ggactgcctt acggacattg gatgaagccg aagcatttag aatggtgcct 60 

ggcacacagt tggtgcgtga catggttaag cttcgtgccc ccacccacat ctcatcttga 120 

atgtgacggt ttccccggct ccctcctgcc gccatgtgaa gaaggtcgtt gcttccccnt 180 

caccttccac caccatgatt. gccatggatg ctccccactc caaagcagcc ctggacagca 240 

ttaacgagct gcccgagaac atcctgctgg agctgttcac gcacgtgccc gcccgccagc 300 

tgctgctgaa ctgccgcctg gtctgcagcc tctggcggga cctcatcgac ctcatgaccc 360 

tctggaaacg caagtgcctg ccagagggct tcatcaccaa ggactgggac cagcccgtgg 420 

ccgactggaa aatcttctat ttcctacgga gcctgcatag gaacctcctg cgcaacccgt 480 

gtgctgaaga gganatgttt gcatggcaaa ttgatttcaa tggtggggac cgctggaagg 540 

tggagagcct ccctggagcc cacgggacag attttcctga ccccaaagtc aagaagtatt 600 

"ttgtcacatc ctacgaaatg tgcctcaagt cccagctggt ggaccttgta gccgagggcL 660 

actgggagga gctactagac acactccggc cggacatcgt ggttaaggac tggtttgctg 720 

ccagagccga ctgtggctgc acctaccaac tcaaagtgca gctggcctcg gctgactact 780 

tcgtgttggc ctccttcgag cccccacctg tgaccatcca acagtggaac aatgccacat 840 

ggacagaggt ctcctacacc ttctcagact acccccgggg tgtccgctac atcctcttcc 900 

agcatggggg cagggacacc cagtactggg caggctggta tgggccccga gucaccaaca 960 

gcagcattgt cgtcagcccc aagatgacca ggaaccaggc ctcctccgag gctcagcctg 1020 

ggcagaagca tggacaggag gaggctgccc aatcgcccta ccgagctgtt gtccagattt 1080 

tctgacagct gtccatcctg tgtctgggtc agccagaggt tcctccaggc aggagctgag 1140 

catggggcgg gcagtgaggt ccctgtacca gcgactcctg ccccggttca accctaccag 1200 

cttgtggcaa cttactgcca catagctctg acgttttgtt gtaataaatg ttttcaggcc 1260 

gggcaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 1297 



<210> 34 
<211> 2184 
<212> DNA 

<213> Homo sapiens 
<400> 34 

ggcacgayyc gcccgaggat gtgcrgctgg ccgctgctcc tgccgrgggg gctgctcccc 60 
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gggacggcgg cggggggctc gggccgaacc tatccgcacc ggaccctccr ggactcggag 120 

ggcaagtact ggctgggctg gagccagcgg ggcagccaga tcgccttccg cctccaggtg 180 

cgcaccgcag gcnacgtggg cttcggcttc: tcgcccaccg gggccatggc gtccgccgac 240 

atcgtcgtgg gcggggtggc ccacgggcgg ccctacctcc aggattattt tacaaatgca 300 

aatagagagt tgaaaaaaga tgctcagcaa gattaccatc tagaatatgc catggaaaat 360 

agcacacaca caataattga atttaccaga gagctgcata catgtgacat aaatgacaag 420 

agtataacgg atagcactgt gagagtgatc tgggcctacc accatgaaga tgcaggagaa 480 

gctggtccca agtaccatga ctccaatagg ggcaccaaga gtttgcggtt atzgaatcct S40 

gagaaaacta gtgtgctatc tacagcctta ccatactttg atctggtaaa tcaggacgtc 600 

cccatcccaa acaaagatac aacatatmgg tgccaaatgt ttaagactcc tgcgttccaa 660 

gaaaagcatc atgtaataaa ggttgagcca gtgatacaga gaggccatga gagtctggtg 720 

caccacatcc tgctctatca gtgcagcaac aactttaacg acagcgttct ggagtccggc 780 

cacgagtgct atcaccccaa catgcccgat gcattcctca cctgtgaaac tgtgatcttt 840 

gcctgggcta ttggtggaga gggcttttct tatccacctc atgttggatt atcccttggc 900 

actccattag atccgcatla Lgcgctccta gaagtccatc acgataatcc cacttatgag 960 

gaaggctcaa tagataatiic tggacrgagg ttattttaca caatggatat aaggaaatat 1020 

gatgctgggg tgattgaggc tggcctctgg grgagcctct tccataccat ccc tccaggg 1080 

atgcctgagt tccagcctga gggtcactgc actttggagt gcctggaaga gctctggaag 1140 

ccgaaaagcc aagtggaatt catgtgtttg ctgttcttct ccatgctcac ctggctggca 1200 

gagcatcagg ctgcgtcatt ttcgaaaagg gaaggaaatg aaattacttg cctatgatga 1260 

tgatnc^gac ttcaatttcc aggagtttca gtatctaaag gaagaacaaa caatcttacc 1320 

aggagataac ctaattactg agtgtcgcca caacacgaaa gatagagct.g agacgacttg 1380 

gggaggacta agcaccagga gtgaaatgtg tctctcatac cttctttatt acccaagaat 1440 

taatcttact cgatgtgcaa gtattccaga cattatggaa caacttcagt tcattggggt 1500 

taaggagatc tacagaccag tcacgacctg gcctttcatt atcaaaagtc ccaagcaata 1560 

taaaaacctt tctttcatgg atgctatgaa taagtttaaa tggactaaaa aggaaggtct 1620 

ccccttcaac aagctggtcc tcagcctgcc agtgaatgtg agatgttcca agacagacaa 1680 

tgctgagtgg tcgattccaa ggaatgacag cattacctcc agatatagaa agaccctata 1740 

aagccagaac ccttfcggtgt gtggcacgtc ttcttccctc ttcccctgcc acagagattt 1800 

ctcccatcca acttgcttgt ttgccttctg c^aactcagc tgcacgctga gcaccaagag I860 

cttgtgatca aaattctgtt ggacttgaca atgttttcta tgatctgaac ctgtcatttg 1920 

aagtacaggt taaagactgt gtccactttg ggcatgaaga gtgtggagac ttttcttccc 1980 

cattttccct ccctcctttt tcctttccat gttacatgag agacatcaat caggttctct 2040 

tccctttctt agaaatatct gatgttatat atacatggtc aataaaataa aactggcctg 2100 

acttaagata accattttaa aaaattgggc tgtcatgtgg gaataaaaga attctttctt 2160 

ccctaaaaaa aaaaaaaaaa aaaa 2184 



<210> 35 
<211> 949 
<212> DMA 

<213> Homo sapiens 
<400> 35 

ggcacgagct cccattgacc tcaccgctca gagtgcagta ttagggcaag gtttcgccac 60 

tgcctccctc catgaatgta tttctccctc ctgccctggg gacacgggga gtggcccgtt 120 

tctttcccca tctagtccca gaaagatggc gctcggtttt ctgttgttgg attttttttt 180 

tttttttttt tittttgcacc aaagtggcaa ctaggrcagt gttgggggat caagctggcc 240 

tcggggtggg gggcccccac ctgcctctcc ctggttccca cagtgttagc gtccctgaaa 300 

agacaatatt ccctctaaag caataagggg tgacgggccg gggggagtgt ttgctgctgc 3 60 

cggcccccag ctccccctcc cfcgccaggtg tgggggagac tcctgttgtg actgaatgta 420 

acccccccac ccctgccgca gccaatgcag gggaaggggg acactcttcc tgtctcttct 480 

ccccagctaa agagactttg gacttagggg gcccatgagc ctggagaggc cttaaccctg 540 

tgaggaagta tagggggagc cctctcccac ccccatcccc ttctgagagt ggtcaatgtt 600 

tacaagcccc tgagcccccc tgcccaggga ctcagacccc gttgctgtcc ctccccggcc 660 

ccggtcttcc tgggccctcg ctgctcctct gccctttctg gggttggggt gggtgcaggg 720 

gtcaccgtgt tccctgtctg ccctgtaccc acagtctccc cgccccctct ccaccctgtg 780 

tgacttccct ctcttttacc tgctcctgta aatactccct tctcccaata aaactcggtg 840 

tgtgttctcc aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 900 
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aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 949 

<210> 36 

<211> 3338 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (2861) 

<223> n equals a, t,g, or c 
<220> 

<221> SITE 
<222> (332B) 

<223> n equals a, t,g, or- c 
<220> 

<221> SITE * 
<222> (33301 

<223> n equals a,t,g, or c 
<400> 36 

taccgggaca ttctggagtc cgagaagtca acggcgcggt tgctgcggcc gccgcgctcc 60 

ccggcccgag gcgatggaga acggagcggt gtacagcccc actacggagg aggacccggg 120 

ccccgccaga ggcccccgga gcggcctcgc tgcctactt- ttcatgggcc ggctcccatt 180 

gctccggcgc gtcctcaagg gcttgcagct gttgctgtc^ ctgctggccT: tcatctgtga 240 

agaagttgta ccacaatgta ctttatgtgg aggactttat ttttttgagt trgtaagctg 300 

cagtgccttt cttctgagtc tccttatact gattgtgtat tgcactccat tttatgagag 360 

agttgatacc acaaaagtaa aatcatcgga tttttatatt actttgggaa caggatgtgt 420 

gtttttgttg gcatccatca tttttgtttc cacacatgac aggacttcag ctgagattgc 480 

tgcaattgtg tttggattta tagcaagttt: tatgttccta cttgactcta tcactatgct S40 

gtatgaaaaa cgacaggagt cccagctgag aaaacctgaa aataccacta gggctgaagc 600 

cc tcactgag ccacttaatg cctaaagact ctggggagca gatgttacct aaggtagtga 660 

ccctgcattg tggtgcccga gccctggcag aagctcttgt aaaatctgtt aattgtttaa 720 

accacttcct: trggagagca aggggaaggt caagaaggca gttttatcaa tattgtgtca 780 

gtcaccacaa agtaggccag ataagttaaa aaaaattttt ctttaaataa taattgaaac 840 

ttatctcaaa tggagatttt ggtgggagga ggagaaaaca attgttttta aatcacacag 900 

ctcaacggtt gataaatgat tcegtcattc tgttacaggt cattctttta ctaggcttag 960 

cttccaaatt atgctttata gc!:gtataaa catcgtgatt atattcatct acttagaaat 1020 

tgttttattt ttaaattaat ttgcttagct gtttgttttg atgcttagat tatgttctgt 1080 

taatgggaat ttaacatatt taagaaacca atacttaaaa tgttggtcta ggtttttttc 1140 

cccaacatat: attaccaggc tttactgtat ttcactcagc cttaaatgtt ataatatttt 1200 

tggataacgg ttattaactc tttgagacct tcgtatagcc tataaaatgf. atgggagatg 1260 

ttggtatttt atgt.gtataa aagcaacaat accagcaact tcgtgtttat accgcacctt 1320 

ggttgctgat gtcaagtaaa aaaaagattg ttttgtaaca cataaaaaaa tggaagaaac 1380 

tgataccaca uctaaggacc aaagataaga aagacttttt gcccaagaca gtgaaagtaa 1440 

ttacaaaaac aagctttgac cacttaccaa gtatczgaag agatgagttc atactatgat 1500 

ttagaaagtg gztcaattcc cctgttggca tatgattatt: tttactaaaa ttaatacagc 1560 

tctgtgggtc ttccttagtg ttittctttga agccaatctg ttttttttag gacaccagcc 1620 

tttggttttt catctgttcg agatgccLcL Lccctgcctc cttatcagac agaaatggag 1680 

tcatgtgctg ctgcttcatc tagcagaggt tggcctctgg ctctgacact ttttgtcag- 1740 

tgtctt.agg tggtcctgaa tcttgggccc ttttgattgt gaatactgtg -agcaggatc 1800 

ttgagagtcc ttgttctrac ataggcattg ctctagtttg tctttggcaa aaaaaaaaaa i860 

aaaaaaaagt aaatatccag ggaaccctgc ccagaetaat actgttggtg gca-aagaga 1920 

atcaagccat tctcaagaga taacttcata accagaattg tctgntggcc agcagc-gtc 1980 

acagataggc agggcacttg ggatatgacc tttctgtcca ggtgactcac agactagacc 2040 

tttcttatcc tcctcctaga gtctcgacct gggactctag tgttaagatg atgagcccgr. 2100 
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2160 
2220 
2280 
2340 
2400 



gcatcaggtc cttctgcact ttggtggaag tctcccaggg taggtttcct atttgaaaca 
gtggaatcat gtttccagtg ataaagttta atgacctcat cctttttttt ttttctcatc 
tgccatttgt gtgtcttaga tgggttttaa ttgcatgaat gzggctaatg cggttctcag 
aaattggtca gtatggcccr acatagcttc tgctctgtct tactgactca atacctttag 
gatttgtatc agagtttgga tactagtgtt agtggcggtg tcaccactac ttaattggga 
gataatgaaa ccaa-car.gg atgctgtttt tattgggcat gtcatctaag agaggagaaa 2460 
tagctgggtt ttgggtctaa ttatgaataa ggactgattc agaaaacgag tttatggtag 2520 
gtagactaaa gcttcacatc agactgtaec attgtgattt agacctatct aaaattcaga 2580 
gcatatcacc tgggctacct cagggtcacc acccatgtat tgggcttagt caggattgac 2640 
agatacattc tcagctggcc tgtcatataa aacatactgt cattgagctt aagctccgct 2700 
tgttctgagg tttcacctcc atgtgtttca ttggtgcaaa agtggatctc ttagttggtc 2760 
acttaattct ttctttttca gaaagatagt atgttcacty gtatatttgg tcactcttag 2820 
aaccttcctt cacattgttc tttatgggac ccatgaatgg nttagcct~t cttttctatt 2880 
gtagaaggaa ataaatagga gtaaaaagac cattgtagta aataagttca aggggaactt 2940 
gggaccagaa accactgtta tgtacaaaaa aatggcaaat tcaataaact caaatttaaa 3000 
ataattttta aattaacagt tatgataaat tttatatttt atacaaatag attgcttaga 
atggttctca agaattataa g^gaaatgaa ctcacagtac aaaaatttta taattactat 
acttgtgttt tgtttggggg ctgggaaatg tatttttaca ttgtagccaa tcattttata 
tttgtcaatt taaatcttat gggtcttttt ttttttatct ctcttgatgt cagattttat 5240 
agtcttttta aataaaccca tttaat.taaa accttaaaaa aaaaaaaaaa aaaaaaaaaa j3 00 
gggcggccgc tcgcgaatct agaactangn cccacgcg 3338 



3060 
3120 
31R0 



<210> 37 
<211> 1563 
<212> DNA 

<213> Homo sapiens 
<400> 37 

cggcacgagg agaaaaggac tcagaccttc cagattggct ccagaaggca gctgctgtca 60 

gtgattttcc ttacagacca ttgcagccac ctaccttgcg ccatggtccc gagacagcat 120 

ggcgtcttcc cccacacgga cctgcaggtg gcctcttagg agctttctcg ctgaaggagg 180 

tcatccctct tagtcaca-c tgagcagcac tcacttaggc ctcggccccg agggtcatgc 240 

gggggcacct ggccgggttc ccggctctgt ctggtctggc atctgtctgc ctttgggcta 300 

ccttttcggc acagctccct gggcctgtgg ctgctacaag ctggactcca gctcccctgg 36 0 

gctgcagtgc tgctcgcagt ggtccagaaa agaggctggg cacagcggct cctggctcag 420 

cagcatcctt ggcgcaggct ggcccaggag ctccgtgcag ggtgctgcca gttgacccag 480 

ccccggctgc actcaatgtc cgagaaccgg gctggctggg gggcctcttr gacggagctt 540 

tactccaagt cttgctgaat ttcctgagga aaagcactga tgttctcatg gacacaaggg 600 

aggcagaatc tcttgaagta gaataaactg cactcctttc cagtcttcct ttctcagctt 660 

ctcctggacc gccagctcct ccatgcgcct cagacactcc caacgccgca ctgcgggggt 720 

tcctcccgcc ctggcccatc tcatccccct tggctgttga tacagctccc ctgtgtgcat 780 

gtcgccctgt ggcagatgct cagggacttc tcagactcca ggatcactcc gagtacactg 840 

accacacagc ctgctgcaca aactgcagcg ccagctaagg accaggaatc tgacattgtc 900 

ggaggtgaag ggatcctttg tgacatagct ttcctccagg aggaccatcc actgggtgtc 960 

gggggagctt ctgccccaag tagcaggagg gagctgtcaa ggcgtggagt Lcacactcaa 1020 

acactcccag aggacggcac tctgcatggg acgccatcca gctcctttga ctgtggaatc 1080 

aagtacatta tcagctggcc cctggctcct ggctgtgacc tcccttcgct tgaactgagc 1140 

cttgtgtgta agggcgtatc aagttgtatg ggcttcgccg ctgggtgacc ccaaatgcaa 1200 

ctccaggaag catgggttac caatgactaa ggttaacgat cagctgggcg cggcgctgac 1260 

gcctgtaatc ccagcacttt gggaggtcga ggcgaatgga tcacgaggtc aggagttcaa 1320 

gaccagcctg gccaacatgg tgaaaccccc atctctacta aaaatacaaa aattagccgg ' 380 

gcgtggtggc gggcgcctgt agtcccagct actcaagagg ctgaggcagg agaatcgcg- I440 

gaacccggga ggcgcagctt gcagtgagct gagatcr.cgc ccgctgcact ccagcctggg 1500 

cggcagagcg agactccgtc ccaaaaaaaa aaaaaaaaaa aaaaaaaact cgaggggggg i 560 

° CC 1563 



<210> 38 
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<211> 1048 

<212> DNA 

<213> Homo sapiens 



<400> 38 

cccgggccga cccacgcgtc cgacaaaaaa caaggratgt gtgtgccttg ggatgctttt 60 

ctgggcttac cccatctgtg tcttcataga ttctcrgtcc rgccagccat gtctttggtc 120 

cacaggagcc acttcccact tcaaccctcc cacaacctcc ccactgttta ctctcttcat 180 

gccccgtgcu cntgccccaa atcccttcac ccaafctggga aaactggatg atagataagt 240 

agtttcttat ttacatgcat ttctctggaa atzaaagttg ctttccagct agtatttcct 300 

cacgtcggaa gcatgtgtaa aataacacat ggaaaccatt tccacctggc acacaagaaa 360 

aaaagggtct gcctatattg taactgactt ctacactttg catgtttcag tctgacccac 420 

aat^tttttt tttggatagt tgaagtaaaa aacgtggtta caagatgact aaagtttggc 480 

ctacaatccc ctgtttggac aacaggaaga attgtgcagt gttttgcaac aqqactaatt S40 

ctagattccc actgccctta aagataacta gagggaaagg gtcttctctt tctttttcat 600 

ttattaasga tacttacagg ttcttcagaa gaagtgatgg gtcct^tagg toatgaaata 660 

ggtgttgatg gtgttatggg j£gatgatgta actggtggcg gctgtataaa gtcttcatcc 720 

ttaaacatca gaactgagag ttigggttact gatttaaaag aaaatgtgta ttaatctaca 780 

gcccactacc ttgactacca ggaacttgtc acaactacct catatcctgg cacaggattc * 840 

aaaaggcaaa atctggaacc taattacgtg f.agccaaata ttaagcttcc tcttagacca ' 900 

aacagtacaa taaatattcc ttattaagga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 960 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1020 

aaaaaaaaaa aaaaaaaagg gcggccgc 1048 



<210> 39 

<211> 1430 

<212> DNA 

<213> Homo sapiens 



<400> 39 

cggcacgagg agcagctgag tcccttccct gtctttcact cttccggcat cggtggtttt 60 

acttctccga ttgaaccctg cttcctcgac ccccctggga ggccgccttc ttcaggcgcc 120 

tcccctctct ccacgagctc gctctgacag ctgaggaact ggcaagatcc tgctacccag 180 

agggtgaatg ggtatctttc ccggaataat cc taattt.ttr. cfcaagggtga agtttgcaac 240 

ggcggccgtg attgcaagcg gacaccagaa aagtaccact gtaagtcatg agatgtctgg 300 

tctgaattgg aaaccctttg tatatggcgg ccttgcctcc atcgtggctg agtttgggac 360 

tttccctgtg gaccttacca aaacacgact tcaggtrcaa ggccaaagca ttgatgcccg 420 

tttcaaagag ataaaataza gagggatgtt ccatgcgctg tttcgcatct gtaaagagga 480 

aggtgtattg gctctctatt caggaattgc tcctgcgttg ctaagacaag catcatatgg 540 

caccattaaa attgggattt accaaagctt gaagcgctta ttcgtagaac gtttagaaga 600 

tgaaactctt ttaattaata tgatctgtgg ggtagtgtca ggagtgatat cttccactat 660 

agccaatccc accgatgttc taaagatccg aatgcaggct caaggaagct tgttccaagg 720 

gagcatgatt ggaagcttta tcgatatata ccaacaagaa ggcaccaggg gtctgtggag 780 

ggtaagtact cttttcctgc tattatccta cactctcagt tcctacaatt tgcagagaat 840 

tttttttr.at ataaagacat aaaatcgtga attataatcc aaaaactaag gtaagoaact 900 

cctcatctcc cttgaaaggc ccaaaactta tcattggcct tttatttctg cataatgttt 960 

ggggautata taggtgggga aagtcattac attatttgag atggctgttt cgatcatatt 1020 

cacagtgaat gtagttgttc agtgtatttt tttgcaagCt ctgtactaac acgatgatgt 1080 

acgtctttgt agtgcttatg ttcaaaagct gttaccggct gtgtgctgtg gctcatgcct 1140 

gragtcccag cactttggga ggccaacgcg ggtggatcac ttgaggtcag aagttcaaga 1200 

ccagcctggc caaccLggcg aaaccccatc tcaaccagaa atacaaaaat tagccaggca 1260 

tggtggtgca tggctgtggt cacagctact caggaggctg aggcgggaga attgtttgag 1320 

ccctggaggt ggaggttgca gtgagccaag azcatgccac tgcactorag cctgggtgac 1380 

ggggcgagac tctgtctcaa aaaaaaaaaa aaaaaaaaaa aaaactcgag 1430 



<210> 40 
<211> 2103 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (2101) 

<223> n equals a.t,g, or c 
<220> 

<221> SITE 
<222> (2102) 

<223> n equals a,t,g, or c 



<400> 40 

ttcctcgtag cgagcctagt ggcgggtgtt tgcattgaaa cgtgagcgcg acccgacctt 60 

aaagagtggg gagcaaaggg aggacagagc cctttaaaac gaggcgggtg gtgcctgccc 120 

ctttaagggc ggggcgtccg gacgactgta tctgagcccc agactgcccc gagtttctgt 180 

cgcaggctgc gaggaaaggc 1~cctaggctg ggtctgggtg cttggcggcg gcggcctcct 240 

ccccgctcgt cctccccggg cccagaggca cctcggcccc agtcatgcta agcagagtat . 300 

ggaagcaccc gactacgaat gctatccgtg cgagaacagc tattccacga gaggatccgc . 360 

gagtgtatta tatcaacact tctgtttgca acactgtaca tcctctgcca catcttcctg 420 

acccgcttca agaagcrctgc tgagttcacc acagggtgtc ctgggccggg cctmtgagac 480 

agtggtgatg ttgatgctcc tcac~ctgct ggngctaggt atggtgtggg tggcatcagc 540 

cattgtggac aagaacaagg ccaacagaga gtcactccac gacctttggg agtactatct 600 

cccccacccc tactcatgca tctccttcct tggggttctg ctgctcctgg ctgctggaag 660 

acctggagga gcagctgtac tgctcagcct -tgaggaggc agccctgacc cgcaggatct 720 

gtaatcctac ttcctgetgg ctgcctttag acatggagct gctacacaga caggtcctgg 780 

ctctgcagac acagagggtc ctgctgggta tgtggcttcg tagggcttgg gatacctggg 840 

tttccccaag gagagtagcc cctggttcca ggtgcttgct gacagcctcc catccctgca 900 

cagagaagag gcggaaggct tcagcctgkc aacggaacct gggctacccc ctggctatgc 960 

tgtgcttgct ggtgctgacg ggcctgtctg tgctcattgt ggccatccac atcctggagc 1020 

tgctcatcga tgaggctgcc atgccccgag gcatgcaggg tacctcctta ggccaggtct 1080 

ccttctccaa gctgggctcc tttggtgccg tcattcaggt tgtactcatc ttttacctaa 1140 

tggtgtcctc agttgtgggc ttctatagct ctccactctt ccggagcctg cggcccagat 1200 

ggcacgacac tgccatgacg cagataattg ggaactgtgt ctgtcccctg gtcctaagct 1260 

cagcactccc tgtcttctct cgaaccctgg ggctcactcg ctttgacctg ctgggtgact 1320 

ttggacgctt caactggctg ggcaatttct acattgtgtt cctcracaac gcagcctttg 1380 

caggcctcac cacactctgt ctggtgaaga ccttcactgc agctgtgcgg gcagagctga 1440 

tccgggcctt tgggccggac agactgccgc tgcccgrctc cggtttcccc caggcatcta 1500 

ggaagaccca gcaccagtga cctccagctg ggggtgggaa ggaaaaaact ggacactgcc 1560 

atctgctgcc taggcctgga gggaagccca aggctacttg gacctcagga cctggaatct 1620 

gagagggtgg gtggcagagg ggagcagagc catctgcact attgcataat ctgagccaga 1680 

gtttgggacc aggacctcct gcttttccat acttaactgt ggcctcagca tggggtaggg 1740 

ctgggtgact gggtctagcc cctgatccca aatccgttca cacatcaatc tgcctcactg 1800 

ctgttctggg ccatccccat agccatgttt acatgatttg atgtgcaata gggtggggta 1860 

ggggcaggga aaggactggg ccagggcagg ctcgggagat agattgtctc ccrtgcctct 1920 

ggcccagcag agcctaagca ctgtgctatc ctggaggggc tttggaccac ctgaaagacc 1980 

aaggggatag ggaggaggag gcttcagcca tcagcaataa agttgatccc agggtttgct 2040 

ttgttittttt aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2100 

21 03 



<210> 41 
<211> 2349 
<212> DNA 

<213> Homo sapiens 
<400> 41 

tcgaccuacg cgcccgccca cttctgtgtc tgtatgtatg ggatgtgtct gattgtgtgt 



WO 00/06698 



PCI7US99/17I30 



26 



ctgtggtgta tttctgtgtc tgt-atgtatg gggtgtgtgt gattgtgtgt gtgtctggtg 120 

tatctctgta tgtatggggt gtgtctgtgt gtgattgtgt gtgtgtctgg tgtatctctg 180 

tgtctgtatg tatggggtgt gtctgtgtgt gattgtgtgt ctgtgtttat gtgtgtgtgt 240 

ctttgtgtga ttttttgtgt atatggcaaa cccagaacag aacattatca ctcacctcac 300 

ctggcaaaac aaaaagctt" cagagagatg tgtggccgtc atgatgrctc agctgcaggc 360 

atttttcaaa gttatgtttg atgcagctgg cacctttctt ctcggggaca aatgtgatct 420 

ttgatcattg agttgagaat gagtttactc aataatttta aaatgcctcg gtgtgagagg 480 

atgggctgct agtaaacatt tgtggcaaat atgcgttgcc ttttcctcct tcaagtgcat 540 

ggggcaggct gcctctctgc acgtccatct gtigggtcagt ctgtcctgcc acctcccttg 600 

ccccagggct tagtgccctg tigtggatcac ggctgcccca gggatgctgg ggaaggattc 660 

agcccaacaa atagattcgt tgacgtcctc tcaacgaatg tgtgctttta aaaatgtc-t 720 

taattttgca ctccatttca tacaacaggt gcacatttaa aaeattfcatt tttaaacagc 780 

tctgtcttct cagcactgca gactatttct cgctcacttg acaggcagac gtzgggacta 640 

tgttcttttc ctgccaagaa tccatcagcc ccccaatgcc tgcacctcga gtccagactc 900 

cttagcgttg cgttgggacg taattccccg ttagcatccc cagccccttc tccccctgtg 960 

agccagcctg ggttcttttc tttttccttt cttccctgtt ttccttcttt cttttttggg 1020 

tttgtaataa gtaatataca gtpttgtggt taagaaatca aaacaataaa gaagatacgg 1080 

cttgaggatt crtgctctgc actggtcctt ccgccaggac ctggtgcgtc ccccaagagg 1140 

gccttgccat gtgcgagctg agacacatgt tcacactttc ctcctttttt gtacaaaagg 1200 

tagcacatgg taaatagcgt ctgcacgctt ctcccctcgc tctgtcttcg ggactactcc i260 

acatccatga agaagcagca tccttattct cttgcacagt gggcagtggc tacactgtat 1320 

tcactcgtct ctgctgatgg gcactattgt agtggtgcaa ttagtaacct agtatacatg 1380 

cccttcagcg tgtgtactgg tgtatggatc tcacattccc agaggcngag tgaaagaata 1440 

aaggcttttt gacagrattc ctcaggtact cacacactcg ctctgtgagc cacgtgatca 1500 

tgggagcggc tccctgcaga ggtgtctgar gggagaggct gatgtccttg tctgatggtt 1560 

gctccaacct gagaatggtg agctgctctc ccaatgcaga ctttcatcac ttgcatctca 1620 

tgcgtgagat tgaactgctt ctcatttatg ttttcatctt tgtttgccca tattttagct 1680 

gggttgctaa tcttttaatg ttttctaggc gcttttcata aattagggag tttagctgta 1740 

gtgatgtgag ttgaaagtat tttttcaaat cgtcattgtc ttgactttcc tttttttttt 1800 

tcttttgcca cttaaaatat gcagttgaat ctcagcattt gggggcttct ggattttgag I860 

tcgtagttga aaaggtcttc ctattctgag gttatacagg aattcaccca tttttttcct 1920 

aatattttat ttccatttaa actttzgacc cacttggact gattttgttg tatggtgtca 1980 

agtatgggtc caagtttgct tttctgacct ggggatccta acatcgttca tcaaaatctc 2040 

catgttggcc gggcgtggtg gcrcaagcct gtaatcccag cactttggga ggctgaggtc 2100 

ggtggatcac aaggtcagga gatcgagacc accctggcta acacggtgaa atcctatctc 2160 

tactaaaaat acaacaaatt agccgggtgt ggtggcaggc acctgtagtc ccagctactc 2220 

aggaggctga ggcaggagaa tcgcttgaac ccgggaagcg gaggttgcag tgagctgaga 2280 

ttgcgccact gcactccagc ttgggtgaca gagcaagact ctgtctcaca aaaaaaaaaa 2340 

aaaaagggc 2349 



<210> 42 

<211> 1559 

<212> DNA 

<213> Homo sapiens 

<400> 42 

attcatgcca aaacataggc tttcagtgcc tattacatat ggctttcagc tctctctact 60 

gagggatgta ggagtttatt tctgaggtct gagcctcttt tcctttactt cctztactct 120 

ttcctaagcc ttctttataa aaactatgca tgttctattg ttttcctttt rgattccctt 180 

tcttttatta tccccagtag gagtgacttg taattctcat atgttagaaa ggc:aggtctc 240 

ctggttgaag aaaagatcca cccaagcaag tcagcagttt aataaatrtt tgagggggat 300 

ctcaaatgtg ggaaggattg ttatataaga caaccaaatg atgacatgag acaataaatg 360 

ctataggaat tatggaggaa taattagcta tttattttct tggttaggga agagata^ta 420 

ztagttgtag aagtaattac taacttctac atttuttatt gtggaaatca aaaatatata 4 BO 

tatgaaaata aaatgttata attgacttca gtgtcccata aaccagcttc caacaattac 550 

caaattgtga ccaatccttt acacacatgc acaggtgzcc ctccagtatc tgcggggcat 600 

tggttctagg accacttatg gataccaaca tctatggatg ctcaagtccc tgatataaaa 660 

tggtggacta tatgcatata acccgtgtac gtcccgtatt aaccaaatcc tccctagatc 720 
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acttacaaca 
aataaccccg 
atatcccagc 
gtgtgtgtgt 
tgtaaatatt 
cttatcctca 
tatttgaatc 
gttcccttta 
tcctagaatt 
tatttcctgt 
tagatcatca 
tgttttttag 
taaatttcca 
aaaaaaaaaa 



cgtaacacaa 
ccaaaaatgt 
ccccaagtcc 
gtgtgtgtgt 
ccctctttaa 
tatacccggt 
tgattcaatt 
ccacttcttt 
tcctgtagtc 
gagttggtag 
actttagatc 
aattacctct 
aattcagatc 
aaaaaaaaaa 



tctaaatgcc 
acttctctca 
cc^tcaaccc 
gtgtgtgtgt 
aaaaattatt 
gttcagat 1 1 
catgtactgt 
attaaattcc 
tgaattatgt 
ttagatctag 
atcaacttgg 
taaaattttg 
agcaaaacac 
aaaaaaaaaa 



acgttaacaa 
gtccagacgc 
cctggcacat 
gtgtgcatac 
atcacagcta 
tctagattgg 
aatgtttgat 
ttgtaatctg 
ggtaattgtt 
aagcttgact 
atcatttgtt 
atttaatttt 
aataaaatct 
aaaaaaaaaa 



ccgtcatact 
cttttttctt 
aggaggctga 
agacacacat 
aacaaattac 
ctcctaattt 
aacttaagta 
ttgtactaaa 
cacatgttcc 
aaattcagat 
tcattttgct 
ataafccatgt 
attcagagaa 
aaaaaaaaaa 



gtattaagcg 
tgtgtgtgga 
ctgcgtgtgt 
atctctgaaa 
cagtaattct 
ttttacagat 
ccctttatag 
tggattgtct 
agtgtcctct 
tttctctctt 
tttgatatgt 
aaaatgttta 
ggcaaaaaaa 
aaaaaaaaa 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1559 



<210> 43 

<211> 1766 

<212> DMA 

<213> Homo sapiens 



<400> 43 
cggcacgagg 
ttagaaaaat 
tgtgtctcct 
ttgttctgtg 
aaggatatgg 
gaagaaaaac 
gaagctgcta 
tcaattattg 
gttcatttct 
tgtgcattca 
tgtgtaagtc 
tctgttttcc 
gtttggatat 
ggggcccaat 
tcgtgatagt 
tcactgtgtc 
catgattgta 
ctgtaaagca 
cttttcttta 
aatacagatg 
gttgtaaaaa 
gcatattatt 
gagagagagc 
gcagctcatg 
ataaaaagtt 
gaagagtgtc 
aagcagaaag 
atttagagaa 
gacatgactg 
aaaaaaaaaa 



agcactgaag 
tgcaagtagt 
tgtagaatat 
tctgaagggc 
ttggatttca 
ctggctgctg 
agcttcctca 
ttttcagtgg 
gtgcttcagg 
tttttcctta 
tttsattctt 
agacccaacc 
ttgtccctgc 
gggaggtgtt 
gagttcttg^ 
ttgctcctgc 
aacttcctga 
tgcagaactg 
caaaatggaa 
gctctgccat 
taaaaggaaa 
aagaagacaa 
atatgttggc 
aaaaattctg 
tgatgattac 
aaatagaatt 
aagcatgaga 
gaaactcttg 
tgttcatgga 
actcgagggg 



tattcactac 
ggtatattgt 
tctctgatga 
ataaagcatg 
gcaatgagtg 
caggttggaa 
ggaaaattat 
ttctggctga 
aaactaagac 
aaacagcctt 
agagttacat 
ctgactgata 
ccaaatttca 
cggatcatgg 
aagatctggt 
tttcaccatg 
ggcctcccta 
ttagccaatt 
agaataaagg 
tagtgagaaa 
ttattaatat 
gcataagttt 
tttgctctgt 
cattgtttat 
aaaaaaaagg 
ttcaatctgt 
aagtctacaa 
attaaatact 
acataaagaa 
gggccc 



atgaagtata 
aattggcatg 
tactcactat 
gaaactacat 
ggctttgcat 
cactggcaac 
ttgtttcaat 
aaattggttc 
catcactggc 
cctaactttt 
ttctctracr 
tagtctccac 
tgttgaactc 
gggcagatcc 
cattttaaag 
tgaagtgcct 
gaagccaagc 
atacctcttt 
tatttcttta 
attgagacgc 
accgtctatt 
aacacaattt 
gaaactcaaa 
tatgtgtcag 
gaagcactaa 
gcataaggat 
tgacatctat 
ttttatgata 
attcctctga 



ttttgcactg 
cactatatga 
tatcccctct 
ttttcagact 
aaaatttgga 
aatagatacg 
atttctggca 
aattcttcta 
agtttttgtt 
actcccccag 
cagtatatcc 
gtgtttcaga 
taatccccag 
ctcacggctt 
tgtttggcac 
gccccagctt 
agatgccatc 
tctttacaaa 
tagcaatgca 
tttctctaga 
gtgatcattt 
ggcaatgaat 
tgaaatggtc 
gatcaacctt 
gtaatatagg 
acttaagcat 
gtcaatataa 
tgaacacaca 
ccaaagagaa 



tggacacaaa 

gcagagtcaa 

ctgctaagct 

ccattaccag 

agacgaaaga 

gagtttgcaa 

tatgggatca 

tctgagaatt 

gagggatcct 

cctctatggt 

tagtgcggct 

tggtgggata 

tgctgtaggt 

ggtgctgtct 

ttgtaccatt 

cacttttcac 

accagggttc 

ttaaaaacct 

agaacggcct 

tggcaaaaaa 

actaagttaa 

aaaattgaag 

acctgttcta 

aaattcagtt 

tacagagagg 

tttttaagga 

caagctggat 

catataatat 

ctggaaaaaa 



60 
120 
190 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1766 



<210> 44 

<211> 2572 

<212> DNA 

< 2 1 3 > Homo sapiens 
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<220> 

<221> SITE 

<222> (2527) 

<223> n equals a,c,g. 



<400> 44 

aattcggcac gagtctggac cttcttagmt tgcttgatat caggttgtct ttgtagccat 
tttgcttcac agtcacctggaacgccggga gcccccgctc atcccgatcc tctccttgta 
catgggcgca cttgtgcgct gcaccaccct gtgcctgggc tactacaaga acattcacga 
catcatccct gacagaagtg gcccggagct ggggggagat gcaacaataa gaaagatgct 
gagcttctgg tggcctttgg ctctaattct ggccacacag agaatcaatc ggcc^attgt 
caaccccttt gtttcccggg accttggtgg cagttctgca gccacagagg cagtggcgat 
^^r g r a f ataccctg tggtcacacg ccatacggct ggttgacgga aatccgtgct 
gtgtatcctg ctttcgacaa gaataacccc agcaacaaac tggtgagcac gagcaacaca 
gtcacggcag cccacatcaa gaagttcacc ttcgtctgca tggctctgtc actcacgctc 
tgtttcgtga tgttf.tggac acccaacgtg tctgagaaaa tcttgataga catcatcaaa 
gtggactttg cctt^gcagaf actctgtgtt gttcctttgc ggatcttctc cttcttccca 
gttccagtca cagtgagggc gcatctcacc gggtggctga tgacactgaa gaaaaccttc 
gtccttgccc ccagctcrgt gctgcggatc atcgtcctca tcgccagcct cgtggtccta 
ccctscc^gg gggtgcacgg tgcgaccctg ggcgtgggct ccctcctggc gggctttgta 
ggagaatcca ccatggtcgc catcgctgcg tgctatgtct accggaagca gaaaaagaag 
atggagaatg agtcggccac ggagggggaa gactctgcca tgacagacat gcctccgaca 
gaggaggtga cagacatcgt ggaaatgaga gaggagaatg aataaggcac gggacgccat 
ro^^H gggaca ^ tca gtcaggatga cacttcggca tcatctcttc ™ctccc«t 
^tcatatca attT^^ ttflt = ttfftt ttggtaatga aagaggccrt gatttaaagg 
.ttcgtgtca attctctagc azactgggta tgctcacact gacgggggga cctagtgaat 

S« ttaCt ?ttgCtatgt aaa — gaaacaacug act^catacc cctgccSac 1260 

£££^ aa aagacacagc tgcctcacgg ttgacgttg,: gtcctcctcc cctggacaat 1320 

f ^-caaagga ctgcagctgt gccatcgcgc ctcggtcacc ctgcacagta iaeo 

l??* * 9 t ^ tCCtSftCC «*ttcatcg ctcttaagaa tcaacaggtt aaaactcggc 1440 

ttaaatt?cc ^acatggc cgtacaaaga gatggagccc cggtggcctc 15ob 

ttaaatttcc cttccgccac ggagttcgaa accatctact ccacacatgc aggaggcggg 1560 

tggcacgctg cagcccggag tccccgctca cactgaggaa cggagacctg tgaccacagc 1620 

aggctgacag atggacagaa tctcccgtag aaaggtttgg tttgaaatgc cccgggggca 1680 
S«a^ aCggttgaat ^atagcattt caetctgcgt tctcctagat ctgagcaagc 
tgtcagttct cacccccacc gtgtatatac atgagctaac ttttttaaat tgccacaaaa 
gcgcatctcc agattccaga ccctgccgca tgacttttcc cgaaggcttg cttttccctc 

?^^^ Ctg aaggtt:gcat tagagcgagt cacatggagc atcctaactt tgcattttag 1920 

gaactgaa * c "taagtctc atccagcatt ctaatgccag gttgctgtag 1980 

^ ^ gaagtaga^a tattacctgg ttctgctatc cttagtcata actctgcgg? 20^0 
tttSr* g ^ aatgUC ta <=ggtactt ccctcccaca ccatacgata aagcaaglca 
9 atacca ^ t cactatgtgg tcctccctga aataacgcat tcgaaatcca 
' g ~? tatatttttc taagttttgg aaagcaggtz ttttccttta aaaaaattat 

SSS?^ cactaaattg atttagccag aattcctaga ctgaaagaac ctaaacaaaa 2280 

^?rt ? dagatacaaa tatatgctgt atatgttatg taatttattt taggctataa 2340 

tato^'! -"ttcgcat tttcaataaa atgcctetoa tacaatacgg tg^tgcttg 2400 

aacaa^ aa ! ^aaacgta ttgtatcaat gaacattgta ccttattggc 2460 

a?tt?cnca a tata™^ " aCttgCatt tgaatgtaag sctcagtaaa tgacagaact 2520 

actttcnca. tatgggtaac tgggggaata aatggggtca ctgggagtag gg 2 572 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



1740 
1800 
1860 



2100 
2160 
2220 



<210> 45 
<211> 526 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (66) 
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<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (106) 

<223> n equals a.t.g, or c. 
<220> 

<221> SITE 
<222> (484) 

<223> n equals a,t,g, or c 



<400> 45 

ctctgacagc ttcctctttg gccaagccct gcctctgtac agcctcgagt ggacagccag DU 

aggtcnagac tggagcccag agcccaagat ggagccccag ctgggncctg aggctgccgc 120 

cctccgccct ggctggctgg ccctgctgct gtgggtctca gccctgagct gttctttctc 180 

cttgccagct tcttcccttt cttctctggt gccccaagtc agaaccagct acaattttgg 240 

aaggactttc ctcggtcttg ataaatgcaa tgcctgcatc gggacatcta tttgcaagaa 300 
gttctttaaa gaagaaataa gatctgacaa ctggctggct tcccaccttq ggactgcctc . 360 

ccgattccct ttgstttcct atccttgcaa attactccar atgattycca aaatctggeg 420 
sccttgtgga ratcttttaa ctggtcagca awtwtcaaac gaaacctcca aacaggaaat 
cttntgcctc ctgcatccac ccccaaagaa cttgcacatt gacgtt 



60 



480 
526 



<210> 46 

<211> 1032 

<212> DMA. 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (974) 

<223> n equals a.t,g. or c 



<400> 46 

gtaaaattgt 

tattgggccc 

tcagagctgc 

gggtgtgtgg 

catcagggtt 

caagttcatg 

cctcaggggc 

ttcttttcgt 

gctgtccagt 

tatgttactc 

ggagttctga 

tccaaaaaaa 

gcagagacac 

gtggtgatca 

aaggggccac 

ggattgcgtg 

tgtgcctgtg 

gtcaaggctg 



caaacactta 
tggctgtcgc 
gagcatgcgg 
tgtgcattgc 
cactgcacac 
Gggaggcatc 
agtagggcag 
tgtatttctt: 
gtctttgtac 
ttttgaaata 
gaatatgaga 
ttgaaggcga 
aacactccct 
gaactggcag 
atgtggtggc 
aagccagcct 
ggtncagcaa 
ta 



ttttcaaaaa 
cactgagcat 
gtgctctttg 
agcaacattg 
gagcctcata 
ca cacagatg 
ccttcatctg 
taatatctta 
tgtacagttc 
caaatttctc 
agacttcttc 
acccgttgga 
ttstagcatg 
ttattctagg 
tcatgctr.ca 
gggcaacata 
cctiuggtggc 



taaaatgtgg 
tgctgaatgt 
tggagarttg 
gtggaacaga 
cacactacag 
gcagcagatc 
ggtgcgcttg 
ctcagcattt 
tcctgtgctg 
tttagaagt-.t 
ccacatagtt 
agctcagtcc 
tcccttggaa 
cctaaaacag 
atccccacac 
aaatttaaaa 
taagaccgaa 



tgggcggtta 
tgtca tgact 
ctctgtgcct 
tgtgtgtgcc 
aaaatatgct 
aggaaatgct 
gcagccctga 
tagtttctct 
ttggaatgga 
ttggaaaatc 
gcgcctgccc 
tgagtttaat 
gaaaataggc 
tttggcaata 
tttgagaggc 
attagccaga 
ggatcacttcr 



tgggtggagt 
tgtggacttc 
gtaggaaggc 
cccatgctgt 
cttgcaaccc 
cccagccagt 
actttatctc 
agttggatga 
cgtcatgggc 
tgttagagga 
gatgtatttt 
tgactacttt 
tggtgcttgg 
aaataagaaa 
rgaggcaggg 
catggtggca 
agcccaggaa 



60 
120 
180 
240 
30O 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1032 



<210* 47 
<211> 2680 
<212> DNA 

<213> Homo sapiens 
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<400> 47 

ggcacgagtc ttacagcaag tacttaacaa actmagattt ccataatcag atttatctca 60 

aaccgcaaaa aaaagctgat ttcattatgt taaactgata ggagattttt atgagagatg 120 

aaagtaagcg attttaactt tttaattttt ttaatttttg ctctgtttct cacacttgag 180 

gcttttctga aattcaccaa aagagttttg gcagr.tgtgg ggaatttacc agaacccccc 240 

attattaaaa caattggrtt cttatattaa aatcatcaga caggttttgt gtacacagac 300 

atttatgtta agccactgga tagaacLaaa ctgtgtgtag gaacccgct" tctctaagaa 360 

aggactccct tgagttctgt aggatatgtg. cttaaaggtt ttctgtgtca agctatttta 420 

tgaaataaaa rtttacaaac aatgaagctt cattctcaca cgttgtattt gtttaatgca 480 

gacttggcct atacgagttt ttgaaattaa gggttttttc cgtaaaacta gcttaggagt 540 

ttaatgagtc actgtgtctc tcacctctgr aggatgagca cagagatcac atttgagaag 600 

caggtgtggg tgttttctgc tggtcacgtc tcctctgcct ctcctctggc Ccagcagtta 660 

cttttcctca gtcatggggg ctgcattcag ctgcgatgcc agcattaggg ccaagctctc 720 

ccgcttctgt gacgttttat tacacgtact ctttatcagrt agtagctgcc attccctgag 780 

tgttccttcc accgtgcact gagaagggct gaggatttcc gaacatcaac ctcctctaac 840 

cctctgtatg ggaattagat ttfcfcatcatc ctgactgttg gggctttcta gagctcaggc 900 

aacttgcccc tgggcgggat ggagcctgcc tggggggagt gtgaatgagg cttctgttgt 960 

ctgtggcttg gctctctgtg gttggaaact tcagaagcag tagattccag gtgcaacatt 1020 

tggtgcccgg acaagttgta aaacc^gacc ttggggaggg cctr.cactgg cagccagtgt ioeo 

ccacccgtga gcccggtgag cacagacgtt agagccaggg ccggcgggct cagaggttgg 1140 

agccggggac cagtgggccc agaggtcgga gccggggacc agtgggctca gaggtcggag 1200 

ccatgtccct gccgtgaagc tctgagggtt ggtggtgaca tcggggcgat tgtggcatcg 1260 

cgattgcggt gtcttggctg cctttggccg gcgctgccac tgccctggtt tcatgacgta 1320 

ggcaccacat ggccctgtgt tgtctcactt acggctccac ttctggcttt ttcctacttg 1380 

ctcacccttg agataaactt ctatttcatt ttcatttcgg tttatatgtg ggttttcttc 1440 

caggtctgat gttaagccta taatattgca atgtgatgtt ttgaagttaa yytgtaatag 1500 

agccagtgaa ccaagggtLc acaccccagt gaaatacaaa tactcagaat tgagccactg 1560 

tgttgccata ctgattatgt aatgtgtgat taacaagtat aatgtgtcac tttcaacatc 1620 

agtttcatgc caaagttgca ttttattaga ttatttggga gttcactttg ggcccaaagg 1680 

ctcgtgtcta cataataata acctatgatt tttctttttg tctttgtttz attttttgtc 1740 

ttgtgttttt tgctttctag accatgccag agtaatctca gctttcttta gttactggat 1800 

cacacatatc cttcctgaga agagcagtga ctaaaatgga atatctcttt aagaacagct I860 

cctctttaac aaaaaaactt aaaagacaaa tgtgagatgg gcttagagtt agttctctgg 1920 

gaacttgaaa gacatttatg ccatattatt tattcacgtg tttgttcctg gtgggcaaga 1980 

tgccatctga ggcttcagat gagaaattgg ggtaaaatgg aaatttttca cttatttgca 2040 

attatatata tcttgaatta ctacataaaa cttgattctg tttczctac^ tattgcaaaa 2100 

attgaaaatg gacattctgt taagttaaat gtatagtttg aagctcazat atttttatga 2160 

agttttgaat caccttgrat ctgaaagtct ctgctttaag aatgctttct gggtattaaa 2220 

atgttctagt ttaagtagtt tgaatatagt tgagt^tttt ttctcttctc tactttgtga 2280 

atcatatcag gtacctgttt ttcctgtttc gactttcttt tctgtgatag aagcagtcgt 2340 

cagttcttgg tattactaag tgttaaaagc atcagtcagg ccgggtgcgg sggctcacgc 2 400 

ctgtaatcct ggcactttgg gagaccgagg caggcggatc acaacgtcag gagatcgaga 2460 

gcatccrggc caacacggtg aaaccccatc tctattaaaa atacaaaaaa ttagccgggc 2520 

gtggtggcgg gcgcctgtag tcccagccac tctggaggct gaggcaagag aatggcatga 2580 

acctgggagg cagagcttgc agtgagctga gatcgcacca ctgcactgca gcctgggcga 2640 

cagagcgaga ctccagctca aaataaaoaa aaaaaaaaaa 2680 



<210> 48 

<211> 1730 

<212> DNA 

<213> Homo sapiens 

<400> 48 



cccacgcgtc cgggggcctg tggcggagtc caagaggaac taggccgcct tcgggatggt 
gtggagcgct gctcctgccc cctgttgcct cctcggggtc ctggggctgg cccaggtgtt 120 
gggggcccaa gccgtgggcc cctggacggc tccagcgtgt ttgggggcag ctcaggctca 180 
gccctgcagg qcctgcaagg agagctctct gaggttatrc tcagcttcag ctccctcaat 



60 
120 



240 
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gactcactga 
ggggcaacca 
=atgctacag 
cagtgcccca 
actgtggctg 
^gggctgggc 
ctggtcgggg 
ctcctggagq 
gctgggcccc 
tcaccaggca 
ggagtggagg 
ttgccccggt 
tattatgatc 
cgcggtgctg 
agggcgtggc 
gtggccgaga 
caggccgggg 
ccgctcacca 
actggggtcc 
tggggcctcg 
cggcaccgcg 
ggggccgccc 
cctgcggagg 
cctgggctcc 
taaagacact 



atgagctcca 

aggaccgzat 

agagtgaaga 

gcttagaggg 

ggggactgca 

tccgggaaac 

gacaggcggg 

accgtctgca 

cagggcctcc 

aggacgggca 

gggcaccagc 

ctgaaccagg 

cagagacagg 

actgggcacc 

cggcgtagac 

gccagcccag 

acacggtctg 

rcttcagogg 

cgcccgacgt 

cctgagacgg 

cccagagcgg 

atgcagactt 

agccgatcct 

aggctccccc 

tggtttttct 



gaccactgtg 

ca'tttctgag 

gcgcttccga 

gcgattgggc 

gggcctgcgc 

caacaccacc 

cctgggcagg 

ccagcr.cagc 

tgggctgcag 

agagggcccc 

agcccctgcg 

cacggttccc 

cgtgttcaca 

ggcacgagaa 

tccgggLggc 

cccgggcacc 

cgtcgacctg 

gglicctgctc 

gtctacgccg 

ggcacctagc 

cctctcccca 

ttggcctggc 

cgcaccctcc 

gcgcgggcgc 

aaaaaaaaaa 



gagggccagg 

attaacaggc 

ggcctagagg 

cgtcttgagg 

gagggccttt 

agccagatgc 

cggctgggtg 

ctgaaggacc 

ggacccccag 

atcgggccac 

ccccaagcgg 

ttcgacagag 

gtcgccactg 

agtggaggcc 

tacgagcctg 

ctgggcgtct 

gtcatggggc 

tatggggacc 

gctgaagaga 

cctgggcgag 

cgcccggggc 

gcgatccccc 

gctccctcca 

cgcccaccgc 

aaaaaaaaaa 



gcgctgatct 

tgcagcagga 

agggacaagc 

gtgtctgtga 

ccagacacgt 

aggcagccrt 

cccttaacag 

tcactgggcc 

gccctgctgg 

caggccctca 

cattttcagc 

tcctgctcaa 

gctggacgct 

gtgctgtggc 

agggccrgga 

tcagcctcat 

agctggcgca 

cagagcttga 

cagcgggggc 

cgccgcaccc 

gcgccggctc 

aagaacccct 

ctggccctcc 

catactaaac 

aaggggccgg 



ggctgacctg 
ggccacagag 
acagcccggc 
acggttggac 
ggctgggctc 
gctggagaag 
ctccctgcag 
tgcaggagag 
acctccagga 
aggtgaacag 
cgctctgagt 
tgatggaggc 
acttgctgag 
cgctccaacc 
gaataagccg 
cctgccgctg 
ctcggaggag 
acacgcgtag 
ggcgggctcc 
gggcccgcag 
agggaggctc 
ccagggccgg 
aggtcgattc 
gatcgaggaa 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1730 



<210> 49 

<211> 1275 

<212> DNA 

<213> Homo sapiens 



<400> 49 

ggcacgagcc 

gtaczgtttg 

catccttct- 

ccaccccttt 

tacctgagga 

cactggtgca 

ttgggggtga 
ttgtccatga 
aggctttagt 
aggccacaga 
aatgcatagc 
cacagtgatg 
aaaatgcrga 
tatctattta 
acacatttct 
tcccagaaga 
ttaaaggcct 
accctacata 
aatgtaaaaa 
tataaggaga 
aagaataaaa 
aaaaaaaaaa 



agctcctctt 
gggaataata 
Cttctggtta 
gttccaacta 
ggacctttgc 
attctggaag 
agtcaccaag 
cttcaaaaca 
tgaaatggca 
tgtggtcact 
aagaggaaag 
ggtctaggca 
ctccaacatg 
caagcaaatc 
gaaaagagtg 
agaaggcatt 
tccagtagga 
ggtctaggct 
gtaaaaaata 
gaaaaaaatt 
aagttaagug 
aaaaa 



ccaagatgtc 
tgctttcaag 
tcttcattca 
atggatcaaa 
ctaggctatt 
gattacaaca 
gagtgtgcaa 
ttgctaagga 
aacaacttta 
gaggaattga 
gaaactacag 
gaagcttttg 
aaatgatttt 
tatgatgaat 
acatctcaag 
gttatcacag 
taaggngtgg 
aatgggtaat 
aaaataacat 
atcttcctct 
aaattgtttt 



ctccttaaac 
gntttgctta 
acgtcacact 
gagttatagc 
gctgcaaata 
tctatggcag 
a^gctatctg 
tgaggatgtt 
agctggatgt 
ctaatgacga 
gagoaaaaaa 
cagaccccaa 
cattagtaga 
aaaagaaaca 
aagagcccca 
gacatgacag 
agatggaagg 
ggtgtgtttg 
taatataggt 
gtatttgtgt 
taaagtaaaa 



tttgccagtg 

ttcttgataa 

gttgtctctc 

agcttttaag 

agaaagatcc 

catcaacaac 

ggaagagata 

gcaaaaaaaa 

ggaggaggat 

gttactggaa 

gaaaiagcccc 

caaactcctt 

gaggaatgtt 

aaccaaacaa 

ggcaggtcct 

ctccctcggt 

tgccgatttt 

tgtcctagtt 

aaaagtituat 

ttcaagctgc 

tagctacaat 



aaatggagaa 
tgctcctgca 
tctccaagca 
gcctgattac 
tttt taaaca 
cttggttaag 
ttgaagacat 
aatattcaca 
gatattgagg 
ccagaacaga 
caaggatatt 
aaaacgtttg 
catcgtgcat 
accaccatga 
tcaggaggta 
gtcattgccc 
tatgatcctg 
tttaaagaaa 
agaataagga 
gttactacaa 
aatctaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1275 



<210> 50 
<211> 1762 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (447) 

<223> n equals 



a , t , g, or c 



<4O0> 50 

ggtggcctag 

ccgctccacc 

gggaaccgcg 

gggtcgcctg 

agtcatttac 

caggagggat 

aaakttcat t 

tgaggagaaa 

catatcacaa 

ggtcacgtac 

gaatgatgac 

agcagttcct 

agaagaaaca 

gaatctggcc 

agttgtatgt 

aaagaagcaa 

ctacaatgtc 

gaatatttca 

tgacaacatg 

tggatttgtg 

gtctggatgg 

caacaatgga 

aaggtctatg 

ctgttggacc 

tatgagaagg 

ttggttgtac 

ggagcttgga 

gacagaaaat 

cggcaattct 

aaaaaaaaaa 



agatgctgct 
gccgtagcgc 
ctacaggccg 
ctgagtgggc 
ttccatga ca 
ersaggccagc 
gaaaacctc: 
caaagcaata 
tttaggaact 
catcagccat 
cggtgcaaca 
tctagagaag 
caggaagaag 
tacatcctaa 

tgggtttgga 
cacaccatct 
ataagaaaac 
ttccgagtgt 
gctgfcgaacc 
accaatgaca 
gtggaaaatg 
aaagaaatga 
aacaagctta 
cccacgtttt 
taccttgccc 
ctaactctta 
aaaccacctc 
ccagaatctt 
catatctgtc 
aaaaaac teg 



gccgcggttg 
ccgagtgccg 
tgctgctggc 
agecagtctg 
cttctcgaag 
tagtcagcat 
tgccatctga 
^gcacagnctg 
ggtatgtgga 
cgycacccgc 
tgaagaacaa 
ctgaaggtga 
atgccaaaaa 
fcccccagca- 
tcLgtagaaa 
ggccctctcc 
aaagegaage 
gttegggaga 
catcagaaag 
tttatgagtt 
aaatatatgg 
taagcaaaat 
gatcaggtcc 
ggctgtatcc 
aggtctggca 
agggacagag 
rgttttcctt 
ttcaaagccc 
t ttttcaaag 
ag 



cagttgtcgc 

gggggegcac 

cgtgctgctg 

ccggggaggg 

accgaacttt 

cgagtctgaa 

zggtgacttc 

ccaggacctt 

tgagccgtcc 

tggcaccgga 

tttcatttgc 

ggaaacagag 

aacatttaaa 

tccccttctc 

aagaaaaegg 

tcaccaggga 

tgacttagct 

agccactccc 

tgggtttgtg 

ctccccagac 

ttattaggac 

cctcttattt 

tgtggatgag 

tttatcccag 

catagtagag 

ctttacctgg 

gctctataca 

acatatggra 

aataaaaCca 



gcacgcctct 
ccgagccggg 
gtggggctgc 

acacagaggc 
gaggaageca 
gatgaacaga 
tggattgggc 
tatgcttgga 
tgcggcagcg 
ggcccctaca 
aaatattctg 
ctgacaacac 
gaaagtagag 
ctcctccttg 
gagcagecag 
aacagcccgg 
gagacccggc 
gatgacatgt 
actctggtga 
caaatgggga 
atataaaaaa 
tctataagga 
catgtggtcc 
ccagtcatcc 
tctcaataaa 
cagtgataaa 
gcagcacata 
gcacaggytg 
aataaagagc 



gcccgccagc 
ccatgaggcc 
gggccgcgac 
cttgttataa 
aagaagcctg 
aactgataga 
teaggaggeg 
ctgatggcag 
aggtctgegt 
tgttccagtg 
atgagaaacc 
ctgtacttcc 
aagctgeett 
tggtcaccac 
accctagcac 
acctagaggt 
cagacctgaa 
cttgtgacta 
gcgtggagag 
ggagtaagga 
ctgaaactga 
aaatacacag 
ccacgacctc 
agctcgacct 
tgtcacttgg 
gatgggctgt 
ttatcataca 
gcctgtgcat 
aggaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
- 600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1762 



<210> 51 

<211> 2059 

<212> DNA 

<213> Homo sapiens 



<40O> 51 

eggcacgagg 

gtgttcaaca 

ttattgctgt 

ctcatcacaa 

caaagencaa 

aactgttgtc 

acaacaagac 

accatcagca 

tatttccttg 

gecatttaac 

aaaactgtgt 

aactLUgaag 



tttggtctaa 
cagaagcaga 
agttttcatc 
cttttcacgc 
atgcatcaaa 
ctattgactg 
gtgggctgag 
eggaagctga 
gacttagaga 
ccacgccaga 
tgttctggtc 
caagtaggac 



gagfcgagcag 
gcagtggaag 
ttacttctcg 
tgtaagagag 
gagaaatcag 
gagaaccttc 
agttgaaagg 
gcagaacttt 
tgagaatgee 
gtattctggc 
ttataaccaa 
ctgtaaaata 



aactgcgttg 
gaggcacgea 
gtgtctgttt 
gcacaggagt 
aactgaaaag 
cagttccaac 
aactgttcag 
attattcagt 
aaaggtcagt 
ataagaatga 
gataaatggg 
cctggaacaa 



agaugggagc 
tccccagctg 
tattgeaagt 
gcacaagtta 
ttgetgaagg 
tgccacctcc 
ggatgggggc 
ttctggatag 
ggcgttgggt 
acccgacaac 
cc eggaatga 
cattgaacta 



acatttggee 
ataccttegg 
tgtttggtga 
gagcaccatg 
gagcacctgg 
ctccctactg 
ccatctgatg 
acggctttcc 
gggaccagac 
tytcagggag 
tgttccttgt 
gaaactcaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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aagtggtcct 

tgcgtcattg 

tttttatttt 

gataatgtgg 

agtatttcaa 

cgtgaataat 

ttcaaaaga t 

gcactttaaa 

atgecagtca 

agctctgaac 

tcttactcca 

agtccttcaa 

acattgttag 

ggtatcacag 

aatttcttaa 

aataatcaca 

tatagtttga 

gcatgaaaca 

tcttaaaatc 

aaaagcagta 

tgattgtgtg 

ttagctgaaa 

aaaaaaacta 



tgtgatggaa 

gaacacagaa 

gtttgatnca 

tttttgtatg 

ggtgtttact 

gcaatctct:t 

gggatattct 

gatttgaaac 

ttacaattoc 

tatgctgttc 

cccaacacgt 

ttccacagca 

itcatgacag 

gaagttactg 

ttgtttgaag 

ttttcaaccc 

actagcaagg 

ctaatfcgcaa 

ttcaattLgc 

tttatt-tgt 

gtggtggtgg 

tggtgctaaa 

gaggggggg 



agagaaaaga 

aacatgctgg 

ttcgagacaa 

gtgtttgacg 

tttcaattgg 

tgtcattttt 

aactggtagt 

cacattttta 

attgcatgag 

tacagatgga 

tr.ttgcaaag 

gaggaaataa 

acccaggtgt 

gggattactc 

taacattgta 

atttatacaa 

a>£gttattgt 

agggcagtca 

taatttttcc 

aaattagcta 

tggtgacgga 

taaatttgac 



aaaaccaatt 
ttcatacagc 
catgtgtgta 
gaaggaataa 
tgtgcactga 
tcccctttct 
ggtgcatcat 
ttgtttgatg 
gaaaattttt 
aaaaagtcca 
caagaagtct 
aatcccccag 
gcttcattag 
gaccrcatta 
ttcgtgtttg 
attgttaatg 
tttgacaacc 
catccaactt 
tggaaaccta 
ctgatttttt 
tgcgtgtagc 
tgtgtttgcc 



agaataaggc 

gtttttagtc 

tgtgtgtgtg 

cctttctttg 

atgcatgtat 

cagactctta 

ttttaaccca 

tttcaLLttc 

ccagaacaac 

aatgccttra 

ttgtaagaca 

aagccaaagg 

agataacata 

cttagctaac 

cattattaat 

tttctttaga 

agaaattatg 

taataaaata 

actttcctac 

gggtttgaaa 

tgtgttaaaa 

atactcacta 



agaatgtacg 

ataatggtct 

tgtgtgtgta 

ctttcttagt 

ggaagaatag 

gctcttaaaa 

aatattgcaa 

agacttttta 

agtgtggaat 

aaaatttact 

ccctaaacaa 

gctcaccttc 

cattcccttt 

gactggataa 

ttgaatagaa 

gctgtataac 

cttttctggt 

tggtggtctt 

tactactttt 

tactggtgtt 

gccgatactt 

aaaaaaaaaa 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
. IB60 
1920 
1980 
2040 
2059 



<210> 52 
<211> 3282 
<212> DNA 

<213> Homo sapiens 



<400> 52 
cccacgcgtc 
aaaaaagacc 
ctgggagctg 
gagttgtgct 
ttggcaggtt 
gtggtcaatc 
agggtggttt 

ctcacaaatg 
tatttccaaa 
gatataaatg 
aatacagctg 
aacggtatcc 
ggtgatgaag 
cagggagagc 
acctctactg 
gctctgtatg 
tgggcagaag 
tcagaaaata 
gaattgcttg 
ggaggccttt 
cctgaactga 
ctggctgttt 
attcaagaga 
acagaaggcg 
gacttgggga 
aataacaacg 
agccccgccc 
caggtcacct 



cgacttaaaa 
cctgggctac 
tggctgtaac 
tcccaagaca 
ctgggcttgg 
tggcaaagga 
ccgacgataa 
agaaac tgga 
ttttaatgga 
atcacgcgcc 
aagggacagc 
aaaactacac 
gcatgatata 
tcagcttaac 
tacgcatcgt 
agacccaggc 
atgtagactc 
ttcgaacaac 
attatgagtt 
ctgcaagatg 
tcgtatcacc 
ttaagattaa 
atctgccatt 
cgctggacag 
cacccaggc:t 
cccccgcctt 
tgcacatcgg 
actcgctgct 



gagaagcttt 
acggcgtagg 
caactaggaa 
aaggcaagtc 
acgttattcg 
tctgggacta 
caaacaatac 
ccgagagaag 
tgatcccttt 
agtatttcag 
atttagacta 
gatcagcccc 
tccagagcta 
cctcacagcg 
tgtcttggac 
tccagaaaac 
tggagtcaac 
ccttcaaatc 
agtaaattct 
nagggtttta 
attttccaac 
tgacagagac 
cctactaaaa 
agagatcaga 
gaaaaccgag 
cacccaaacc 
cagcgtcagc 
gccgccccaa 



agctgccaaa 
tgcagggttt 
ataacgtatg 
ctgtttcttt 
gtgactgagg 
gcagaggggg 
ctgctcctgg 
ctgtgcggcc 
cagatttacc 
gacaaagaaa 
gaaagagcac 
aactcttttt 
gtgttggaca 
ctggatggtg 
gtcaatgaca 
agccccattg 
gcggaagtat 
aatccttrtt 
tacaaaataa 
gtggaagcat 
tctgttgctg 
tctggagaaa 
ccttctgtgg 
gccgagtaca 
cacaacataa 
tcctacaccc 
gccacagaca 
gacccgcacc 



gattgggaaa 
cctactgctg 
cagcagctat 
ttcttttttg 
aaacagagaa 
agctggctgc 
attcacatac 
ctaaagagcc 
gggctgagct 
cagtcttaaa 
aggatccaga 
tccatattaa 
aagcactgga 
ggtctccatc 
atgccccaca 
ggttccttac 
cctatccatt 
ctggggaaat 
atatacaggc 
tggacaccaa 
agaattctcc 
atggaaagat 
agaattttta 
acatcactat 
ccgtgctggt 
tgttcgnccg 
gagactcggg 
tgcccctcgc 



gggaaaggac 
ttcttttatg 
ggctgtcaga 
gggagtgtcc 
aggatccttt 

aa ggggaacc 
cgggaat-tg 
ctgtatgctg 
gagagtcagg 
aatatcagaa 
tggaggact" 
tattagtggc 
tcgggaggag 
caggtcrggg 
gctcgcccag 
tgttaaggta 
ttttgatgcc 
ctttctcaga 
aatggacggt 
tgacaatccc 
tgagacgccg 
ggtttgctac 
catcctaatt 
caccgtcact 
ctccgacgtc 
cgagaacaac 
caccaacgcc 
ctccctggtc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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-ccatcaacg cggacaacgg ccacctgttc gccctcaggt cgctggacta cgaggccctg 1740 

caggctttcg agttccgcgt gggcgccaca gaccgcggct cccccgcgct gaacagcgag 1B0O 

gcgctgggtg cgcgtgctgg tgctggacgc caacgacaac tcgcccttcg tgctgtaccc I860 

gctgcagaac ggctccgcgc cctgcaccga gctggtgccc cgggcggccg agccgggcta 1920 

cctggtgacc aaggtggtgg cggtggacgg cgactcgggc cagaacgcct ggctgtcgta 1980 

ccagctgctc aaggccacgg agcccgggct gttcggtgtg tgggcgcaca atggggaggt 2040 

gcgcaccgcc aggctgctga gcgagcgcga cgcagccaag cacaggctgg tggtgcttgt 2100 

caaggacaat ggcgagcctc ctcgctcggc caccgccacg ctgcacgtgc tcctggtgga 2160 

cggcttctcc cagccctacc cgcctctccc ggaggcggcc ccggcccagg cccaggccga 2220 

cttgcccacc gtctacctgg tggtggcgtt ggcctcggtg tcttcgctct tcctcctctc 2280 

ggtgctcctg ttcgtggcgg tgcggccgtg caggaggagc agggcggcct cggtgggtcg 2340 

ctgctcggtg cccgagggtc cttttccagg gcatctggtg gacgtgaogg gcgctgagac 2400 

cctgtcccag agctaccagt atgaggtgtg tctgacggga ggccccggga ccagtgagtt 2460 

caagttcttg aaaccagtta tttcggataw tcaggcacag ggccctggga ggaagggtga 2S20 

agaaaattcc accttccgaa atagcttcgg atttaatatt cagtaaagtd cgtttttagt 2580 

; ttcatatact tttggtgtgt tacatagcca tgtttctatt agtttacttt taaatctcaa 3640 

atttaagtta ttatgcaact tcaagcatta ttttcaagta gtatacccct gtggttttac 2700 

aatgtttcat catttttttg cattaataac aactgggttt aatttaatga gtattttctt 2760 

ctaaatgata gtgttaaggt tttaattctt cccaaccgcc caaggaatta attactatta 2820 

tatctcatta cagaaacccg aggttttgat tcatttcaga gcttgcatcr catgattcta 2880 

atcacttctg tctatagtgt acttgctcta tttaagaagg catatctaca tttccaaact 2940 

catcctaaca ttctaratat f.cgtgt t tga aaaccangtc atttatttct acatcatgta 3000 

tttaaaaaga aatatttctc tactactatg ctcatgacaa aatgaaacaa agcatattgt 3060 

gagcaatact gaacatcaaL aataccctta gtttatacac ttattatttt atctttaagc 3120 

atgctacttt tacctggcca atattttctt atgttaactt ttgctgatgt ataaaacaga 3180 

ctatgcctta taattgaaat aaaattataa tctgcctgaa aatgaataaa aataaaacat 3240 

tttgaaattt gtgaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 3282 

<210> 53 

<211> I860 

<212> DNA 

<213> Homo sapiens 



<400> 53 

aattcggcac gagcagcagg gagaagagga tgatgagcat gccaggcccc tggccgagte 
cctgctcctg gccattgctg acctgctctt ctgcccggac ttcacggttc agagccaccg 
gaggagcact gtggactcgg cagaggacgt ccactccctg gacagctgtg aatacacctg 
ggaggctggt gtgggcttcg ctcactcccc ccagcctaac cacatccacg atatgaaccg 
gatggagccg ctgaaactgc tgctgacatg ctuctccgag gccatgtacc rgcccccagc 
tccggaaagt ggcagcacca acccatgggt tcagttcttt tgttccacgg agaacagaca 
tgccctgccc c^cttcacct ccctcctcaa caccgtgtgt gcctatgacc ctg^gggcta 
cgggatcccc tacaaccacc tgctcttctc tgactaccgg gaacccctgg tggaggaggc 480 
tgcccaggtg ctcattgtca ctttggacca cgacagtgcc agcagtgcca gccccactgt 540 
ggacggcacc accactggca ccgccatgga tgatgccgat cctccaggcc ctgagaacct 600 
gtttgtgaac tacctgtccc gcatccatcg tgaggaggac ttccagttca ccctcaaggg 
tatagcccgg ctgctgtcca accccctgct ccagacctac ctgcctaact ccaccaagaa 
agatccagtt ccaccaggag ctgctagttc tcttctggaa gctctgcgac ttcaacaaga 
aattcctctt cttcgtgctg aagagcagcg acgtcctaga catccttgtc cccatcctct 
tcttccccaa cgatgcccgg gccgatcagt gtaagaccag ggtgggggtg ggatgctggg 
ggctttcctg gcggcgaggt ggagggctgg ctatctgccc tgactcccag gagctcagcc 
tggcactctg accccc-agg agggaggcct ccaaaatggt cactgcctgg acccctcctL 1020 
ccccctccca cccgagtgac ccctcccacg ccacctgccg cagctcgggt gggcctgatg 1080 
cacattggtg tcttcatctt gctgcttctg agcggggagc ggaacttcgg ggtgcggctg 1140 
aacaaaccct actcaatccg cgtgcccatg gacatcccag tcttcacagg gacccacgcc 1200 
gacctgctca -tgtggtgtt ccacaagatc arcaccagcg ggcaccagcg gttacagccc 1260 
ctcttcgact gcctgctcac catcgcggtc aacggtaggg gcccggagcc tgcactcccc 1320 
gcgctcacca ccaggtggtg tcagccacag gccaacctgc cacttgcccc aggtttcaag 1380 
cccctccagc ccacagaaag aacgcaagct tgtgtaccct tcatagtagc cacat.aaaa 1440 



60 
120 
180 
240 
300 
360 
420 
480 
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780 
840 
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960 
1020 
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aaaaagggtg acaacaattg aataatacgt tttatctatc atcatctcaa catgtagcca 1500 

atatgaaaat tactaatgag atatctaata ttcccttctc acactcagtc ttgtcgccca 1560 

ggctggagtg cagtggcacc azctcagctc actgcaacct ccgccccttg ggttcaagca 1620 

attctcatgc ctcagcctcs tgagtaggtg ggattacagg cgtgtgccac cacacctggc 1680 

taatttttgt atctttagta gagacggggt tttgccatgt tgtccaggct ggtctcgaac 1740 

tcctgacccc agatgatang cccacctcag cctcccaaag tgczgggatt acaggcatga 1800 

gccactgcat ctggcctttt taaaaaaaaa aaaaaaaaaa actcgagggg gggcccggta I860 



<210> 54 

<211> 770 

<212> DNA 

<213> Homo sapiens 

<400> 54 



aattcggcac gagccgaggg gcgaatccat acatccgtcg tgtggactcg ccctgacatc 60 

ccctcaacgc ttagactcgg cattcttctc attgcctgat gccgtctcat gccttctttg 120 

cgtttcctcg ctctggccct cctgctggcc atccttcccg cacttcccaa cgcccacgcc 180 

gcgccgggga ttggcggcct gatcggcggc ggctcccagg cctcagccaa ggaagagccc 240 

cagagcaacg cgcaacccag cgccgatgag cgcaaacaac gaccgc-cag ccaggccgag 300 

gaaacccggc agcgcctgac cgatctcaag gccgaactcg ccggcgcacc caaggagatc ' 360 

agcgaggccc agcgtacgct ctccaaactg gtgagcgagg acaacagcga tctgcccgag 420 

cgcctctcca agctttcggt gccggtactg gagcaacgcc tggcggcccg cgtggacgag 480 

ctcgccctct ggcaacaagc gctcagcgcg gccaacagca tgctcatcag cgcgcagacc 

cggcccgagc gcgcccaggc cgatatcagc aagaaccagt tgcgcatcga cgagatcaac 

ggcccgctga aaagcggtcg ggagaacaac aagccgctga cggacgaacg tcgcgcgctg O ou 

ctcgagagta cttctagagc ggccgcgggc ccatcgattt tccacccggg tggggtacca 720 

ggtaagtgta cccaactcgc cctatagtga gtcgtattac aattcactgg 770 



540 
500 
660 



<210> 55 

<211> 1093 

<212> DMA 

<2X3> Homo sapiens 



<400> 55 

cagattcggc acgaggtttt ttitaagatt tattttttaa aaattatttt tgtggacttg 
ggtatcaatg atggcaccta ctcttgggaa nctgtagctg tgctttgaga attgccatcg 
gtcacgtgtt gcaccgttct ctgtatgttt acgtcctttg gaccggctLc Lcccaggatt 
ettttctgtt tttgttcttt tgatttgggc tttatctctt cctgtgtact gtactatatt 
gtaaaaggga ttttagcaga gactttagtc tttggggcaa gaggagaaca ggaatgccgg 300 
gctgtttacc ttaggcggag aacccatctt cagacctttg gactattttc tttcaactgc 360 
agtgtataga aaaaccaaac tacgacctca gagcagagta ttaatgaaaa gcacaaaaaa 420 
aggaactaag ttcagcgagg ggtgggggga ggggggagat ttLtcttttg aaaaataatg 
actcttagga catttgtttt tcagctcaag tgctcctcag cactgtcttg tctcccaata 
taccaaccca ctggcacatt tttctcttgt tttctctctc cgattttgct ctgtctcctc 
agttaagtgt ttccttcctt tgtgcccccc gctggcgacc ctctgcttcc ctcuctcttt G60 
ccctttggca gctgcaatac acagtgttat tttggggaaa taaatctagc aaagcctcgc 720 
cttccatgcc gagcgtcctc ttggctctga gagggaaagg tctgtccttg ggatgctctc 780 
tggtcttttt tccccccaag tctttctctt tcccatcata cccztccctg cccaccttgt 840 
tttctg-tcc ccttttatta ggaattccca agtqaatttt attaatgtgg gagtggaaca 900 
gacgctaaaa gctatccagg atttr.gr.ttc tgtttgtttt aaattttgtg gttccttccc 960 
tttcctcccc ctcccatgcg taagacgttc tgtgtaacct ccat-aaatt tggtacaaaa 1020 
ccactcgcca gagctgtggt gtcagaaaaa taaaatatat tgcttctcatn aaaaaaaaaa 1080 
aaaaaaaact cga 



60 
120 
180 
240 



480 
540 
600 



1093 



<210> 56 
<211> 632 
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PCT/US99/I7130 



<212> DNA 

<213> Homo sapiens £ 
<220> 

<221> SITE 
<222> (29) 

<223> n equals a,t,g, cr c 



<220> 

<221> SITE 
<222> (46) 

<223> n equals a,t,g, 
<220> 

<221> SITE 
<222> (94) 

<223> n equals a,t,g, 
<220> 

<22l> SITE 
<222> (162) 
<223> n equals a,t,g. 



<4O0> 56 

cgcaattaat 

cggctcgtat 

catgattacg 

cgcgg-ggcg 

ctatgtatat 

gtctattttt 

gcctgcctca 

gtagcttnaa 

cttaaaaaat 

agatccttat 

aataaaaaaa 



gtgagtcagc 
gttgtgtgga 
ccaagctcga 
gccgctctag 
atgtttaata 
cttttttttt 
atgtatattt 
atgaataggt 
actttgtttt 
atatttatgt 
aaaaaaaaaa 



or 



tcactcatna 
attgtgacgg 
aattaaccct 
aactagtgga 
tctgtctttt 
aatgctaccc 
atxtcttttg 
gagcaaaatc 
gqtacatttg 
taaagcaata 
gctcgagggg 



ggcaccccag 
atancaattt 
cactaaaggg 
tcccccgggc 
gaaatgcaga 
agggaaatat 
aaacaaaaag 
tatatgggat 
gttgtgcttg 
ttttattatc 

gg 



gctttncacc 
cacacaggaa 
ancaaaagct 
tgcaggaatt 
aatagtttaa 
tttcatatca 
gttctggaaa 
gtaatttttt 
tggggaaaat 
tacataaaac 



ttatgctttc 
acagctatgc 
ggagctccac 
cggcacgaga 
atgtttcttt 
tttttaagtg 
ctgtttttct 
tgttcagtct 
aaaaacgcag 
agauatgcac 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
632 



<210> 57 
<211> 2687 
<212> DNA 

<213> Hoiao sapiens 
<220> 

<221> SITE 
<222> (1614) 

<223> n equals a ( t r g, or 



<400> 57 

gtacaccatg 

gttcctcttg 

cccagtggtg 

gacagtggtg 

cctggaaccg 

acaacaaaac 

ttgggaagac 

tccacaccat 

gggctcccta 

gcgagatgat 

gtatggrrcaa 



ggcctccacc 
ctgctgctaa 
ctggtccctg 
cactacctct 
ctgctgcctg 
atccagggcc 
cttctcactg 

ggtggagagc 

tgactggcgc 
cgaggagatg 
catgtacacg 



tccgccccta 
tgctgctcgc 
gcgacttggg 
gctccaagaa 
tgcatcattg 
acccagtttc 
gagttcctgg 
cttgtgggct 
cgagccccaa 
taccagctgt 
ctctactttc 



ccgtgtgggg 
ggacccagcg 
taaccaactc 
gaccgaaagc 
actgctggat 
ctgatggtgt 
accccagcaa 
ggggccacac 
atgaaaacgg 
atgggggccc 
tgcagcggca 



ctgctcccgg 
ctcccggccg 
gaagccaagc 
tacttcacaa 
tgacaatatc 
ggatgtacgt 
aagcagcgtg 
acggggtgag 
gccctacttc 
cgtggtgctg 
gccgcaggcc 



atggcctcct 
gacgtcaccc 
tggacaagcc 
tctggctgaa 
aggctggttt 
gtccctggct 
ggttcctatt 
gatgtccgag 
ctggccctcc 
gttgcccaca 
tggaargaca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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720 
780 
B40 
900 
960 
1020 
1080 
1140 



agtatatccg ggccttcgtg tcactgggtg cgccctgggg gggcgtggcc aagaccctgc 
gcgtcctggc tncaggagac aacaaccgga tcccagtcat cgggcccctg aagatccggg 
agcagcagcg gtcagctgtc tccaccagct ggctgctgcc ctacaactac acatggtcac 
ctgagaaggt gttcgtgcag acacccacaa tcaac.acac actgcgggac raccgcaagt 
tcttccagga catcggcttt gaagatggct ggctcacgcg gcaggacaca gaagggctgg 
tggaagccac gatgccacct ggcgtgcagc tgcactgcct ctatgqtact ggcgtcccca 
caccagactc cttctactat gagagcttcc ctgaccgtga ccctaaaatc tgctttggtg 
acggcgatgg tactgtgaac ttgaagagtg ccctgcagtg ccaggcctgg cagagccgcc 
aggagcacca agtgttgctg caggagctgc caggcagcga gcacatcgag atgctggcca 1200 
acgccaccac cctggcctat ctgaaacgtg tgctccttgg gccctgactc ctgtgccaca 1260 
ggactcccgt ggctcggccg cggacctgct gttggcctct ggggctgtca tggcccacgc 1320 
gttttgcaaa gtttgtgact caccattcaa ggccccgagt cttggactgt gaagcatctg 
ccatggggaa gtgctgtttg ttatcctttc tctgtggcag tgaagaagga agaaatgaga 
gtctagactc aagggacact ggatggcaag aatgctgctg atggtggaac tgctgtracc 
ttaggoccgg ctccacaggg tggactggct gggccctggt cccagtccct gcctggggcc 
atgtgtcccc cctattcctg tgggcttttc atacttgcct actgggccct ggcncsgcag 1620 
ccttcctatg agggatgtta atgggctgtg gccctgtacc cagaggtccc agggatcggc 1680 
tcctggcccc tcgggtgacc cttcccacac accagccaca gataggcctg ccactggtca 1740 
tgggtagcta gagctgctgg cttccctgtg gcttagctgg cggccagcct ga=tggcttc 
ctgggcgagc ctagtagctc ctgcaggcag gggcagtttg ttgcgttctt cgtggttccc 
aggccctggg acatctcacc ccactcctac ctcccttacc accaggagca ttcaagctct 
ggattgggca gcagatgtgc ccccagtccc gcagctgtgt tccaggggcc ctgat~tcct 
cggatgtgct attggcccca ggactgaagc tgcctccctt caccctggga ctgtggttcc 2040 
aaggatgaga gcaggggttg gagccatggc ctnctgggaa cctatggaga aagggaatcc 2100 
aaggaagcag ccaaggctgc tcgcagcttc cctgagctgc acctcttgct aaccccacca 2160 
tcacactgcc accctgccct agggtctcac tagtaccaag tgggtcagca cagggctgag 
gatggggctc ctatccaccc tggccagcac ecagctragr gctgggacta gcccagaaac 
ttgaatggga ccctgagaga gccaggggtc ccctgaggcc cccctagggg ctttctgtct 
gccccagggt gctccatgga tctccctgtg gcagcaggca tggagagtca gggctgcctt 
catggcagta ggctctaagt gggtgactgg ccacaggccg agaaaagggt acagcctcta 
ggtggggtcc ccaaagacgc cttcasgctg gactgagctg ctctcccaca gggtttctgt 
gcagctggat tttctctgtt gcatacatgc ctggcatctg tctccccttg ttcctgagtcr 2580 
gccccacatg gggctctgag caggctgtat ctggattctg gcaataaaag tactcMgat 2640 
gctgtaaaaa aaaaaaaaaa aaaaaaaaaa. aaaaaaaaag ggcggcc 2 687 



1380 
1440 
1500 
1560 
1620 



1800 
1860 
1920 
1980 
2040 



2220 
2280 
2340 
2400 
2460 
2520 
2580 



<210> 56 
<211> 619 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (526) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (619) 

<223> n equals a,t,g, or c 
<400> 58 

tcatcagctt tcacctggai catcgtgaca gcctcctact gcttctctat cargtggcca 
gagctatctt cctaaaatgc actgcatagt tgatcaagtc actctctggc ctaaaacctt 
ccttggctcc ctgctgccct caggataaag tctggacccc tcagcatggc ttgtgagact 180 
catggtgtcc ttgtccctgc tcacctctct ggtctcatca cttgccttct tgcattctga 240 
gtcccagcct cctgtatcca gagatgcagt ggctctccat tgccactctg attcctcctt 
ccttttggtc acagagaaag ggtactttct ctgtcaaatc tcaacttaga cctoacttcc 
tccaaggagc tttggctata ctctctcctc ccgaccccca ccctggcata ctacacagar 



60 
120 



300 
360 
420 
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cactctgggc tcacttgcct gcctaatggt catctcccca gtagac.tgta agctccttga 

gggcaaggat tgtgztggaa tttttgtact aacagtgcct ggcctngtgc ctggcaccta 

gaaagcactc aataaatgtt tgtttaatga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaan 



<210> 59 
<21i> 1378 
<212> DMA 

<213> Homo sapiens 



<400> 59 

tcgacttctg ggatgtagat tctcctttat tgaattcttc attttcactc gcagtatgat 60 

ttattttttt cacactgtac tttacactgc atatgtttct cttataacga gggtcacttt 120 

cttcctgccc aggattggaa tcaattacag ctatgcaatg ttttttgttt agcatcttct 180 

taattacagg cttggcagaa gagttttgtg aacaactctc catttcactg gctgaagaag 240 

aaatacaact gtcctctaca gt^aacatr tctgtatgac agent tttca tggtaatgag 300 

gattaggttc atactttggt ttlftccaaac eggggaaaca aacaacagcc aaaaaccagt 360 

gtgcagctLc attaaggggt acaaaaataa aatccttctc aaaaatatct acgtgccggg 420 

tccatgtttt cactctccca tgccgttttt gctgtattga casrattagtt: gtttcatgat " «0 

ctctcctc-c tctctgatta aggegtttat agaaaaaaga actgaatata tgaattcggt 540 

cagcgtcttc cttcttcagt ttttcaagca ccaagtattt caaataaaag tctataataa 600 

catcatttaa aaattctcct tcatttagac agtgcaggtc ctcattggta acagagatgc 660 

ctcccttagc tggaggcggt ggatatacta tcaacttttc tactgggeca atgaagatgg 720 

tgtggttttc tccagtttct tcttcttcat caaaaaactg aaattcttgt ttgcttctaa 780 

gttgtatttt agatt=aaat gatacagttt taatttcgtt ttccttttgc ccacaacttc 840 

etttgatget ctcttcatag gttcttgtac aggcaacaag tetgecatta gcttcttcaa 900 

agggaatttt cgcaaaaaaa ttggagatgt tattctzcac accaatttca ttaatgatac 960 

tttcaaatac catatctgcc ggaggatcaa ggccattttg aaaaattaaa attatatatt 1020 

gttcttctaa atttgrtaat ttatttactc ctttacaatc attccaaact ttatcctcct 1080 

tattcatttg cagttggatg ctcagctttt gataaactgc tggaattget tgaagaaaca 1140 

ctacaggtaa tttteggaca ttacaccatt cacatttagt tagatcagag gcatttaata 1200 

taatctctac aggatcatgg tctggttcgt ctagctgtat cttgacaaaa cctaaacaaa 1260 

aaattacagg ctctattaac ageeggaaga gtgttcctac tcgtcgacgc ggccgcgaat 1320 

tcccgggtcg acgagctcac tagteggegg ccgctctaga ggatcccccg aggggece 1379 



<210> 60 

<211> 1126 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (21) 

<223> n equals a,t.g. or c 
<220> 

<221> SITE 
<222> (35) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (49) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
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<222> (99) 

<223> n equals a.t.g, or c 
<220> 

<221> SITE 
<222> (1012) 

<223> n equals a,t,g r or c 
<400> 60 



aatttcacnc 


aggaaccagc 


60 


aaaggaccaa 


aagtgaagtt 


120 


ccgggctgca 


ggaattcggc 


180 


gcgctggcag 


cagcggccaa 


240 


acgccgctcg 


cgctgctgcc 


300 


tttgtcatcc 


tgctcatggc 


360 


gcgctgctgc 


ccat:cgtcct 


420 


ccccagtact 


ccctcgacac 


480 


attgaggagt 


ggaacctgca 


540 


cagccggcca 


ggctcatcct 


-600 


agcaacaccg 


ccf.ccactgc 


660 


tttggccaga 


aggaggttcg 


720 


gaacccaata 


ccttcctctc 


780 


cttggccaga 


taactgagtc 


840 


cgggttctgt ccttcctgcc 


900 


aggagccaca tcttgagaga 


960 


tcagtgaaca 


cnacaagcat 


1020 


ttaaaatatg aattaaatta 


1080 


ctcgag 




1126 



<210> 61 

<211> 2078 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (337) 
<223> n equals a,t,g, 

<220> 

<221> SITE 
<222> (492) 



<223> n equals a,t,g, or c 
<400> 61 

^!r\ ttC aatagatt ^ t caataaatta tcaaattatg gaaatatcct ttagattcct 60 
gtgataatgc tttgtgcatg ttttctatta Ltcagaaaat gaaaacagca gtcatttgac ill 
™" «tg«caccag ataatattct: gcaaggatct tccaaacagg gagaaaSgc ill 
TattTSt* atCttt ^ at -tctacattg tgtcaaatgt tatgtgcttt atitcatttt III 
catacttata acagctgtat atcacagttt ttatxateca cattatagag gcaaggcact 300 
Tatlltli: aCtt9a ^ taa ttggcttaca cagcaantgt agttgaaacl gaagccagga lei 
ctgtctgatt cccaagacag tgttcttgta ccgcaggtaa gcctggtgtt gctgggktEc 420 
aatS^e ^attgcag aaatacggag atttatttac tcaggtgaat gatgaagcag 
ate™ ^cttttta aattatkgat gggkeckgag ttaagctagt ccaccaftga 
gtctcaaagg gtgaagaact ggcctctgca gttttaaaac aaagcagcag tttect-ac 
ttgtagatat tatatttaat ttaccttgea tttccaatag catitaaatat tagaagegea 
taagcttttt etttttatge ccaggggata tggcttaaca ctgagaaaca cttgagaaac 
actgccacac tctaaggaac aaccaataat aagaaaaatt atttcatata aaaggatgra 



480 
540 
600 
660 
720 
780 
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gcttgcatat 
ggggtgacct 

ggagcaatgt 
tttttcttta 
atgcacaca^ 
cttcaatctt 
tgcagaactg 
aggaagttag 
ggt^gcantc 
agtgcttaat 
gcacatcaga 
gattttgtgc 
gtcctttgct 
atattccaca 
gtccgagata 
gaagccagca 
ggccaatatc 
tccagcagca 
tggtgaaacc 
tgtag-ccca 
cttacagtga 
tctcaaaaaa 



aaaatgctga 

tcatgctctg 

gctccataac 

aagg^gtgct 

acatatacmt 

tcaggaaagt 

ctgaaattgg 

tccagctgtg 

ttaggtcaca 

ttaaggatta 

taatgagtta 

atagaaagat 

aaaagccatt 

tttaacaggt 

ttctagttta 

ttaccctgat 

gctgatgaac 

tgaggcgggt 

ccatccctac 

gctactgggg 

gccgagattg 

aaaaaaaaaa 



attcaactga 

gactcttaaa 

tagggttgcc 

tttcaaaaca 

catgcatgfca 

ttgtaaaawa 

ctcagatgaa 

ttttgcgagg 

tggcaattaa 

ttcaatgcag 

actgcatgta 

gtcttagtca 

atataaactg 

gataattatt 

aaccaagtaa 

actagaactg 

j»tcaatgtga 

ggatcacaag 

taaaaataca 

aggctgaggc 

caccactgca 

aaaaaaaaaa 



agttcattat 
ggatccagtg 
taagtacat.c 
tgatgttatt 
tcatatatgt 
tagccctaag 
aaggaaagag 
catgaaccag 
attccagcta 
tgaataaatt 
acttgaaaac 
atgtaaaaat 
aagtcactta 
gaaaatatta 
ctaatactgt 
ggaagagaca 
aaatcttcaa 
gtcaggagat 
aaaaattagc 
aggagaatgg 
ctccagcctg 
aactcgta 



gctgccttta 
tcggacacag 
ccctctcccg 
tcatatacmt 
gtgtgtatgt 
tagaattatg 
caaaaatcta 
gcngaaccag 
tagaatattg 
tatatcattc 
cttaaatatt 
gccagtgatt 
tccctgcttc 
gttsgcctca 
aaaagaagta 
cacacaaaaa 
taaaatactg 
cgagaccatc 
cgggcatggt 
cgtgaacctg 
ggtgacagag 



tgtgagctca 
ctctgccatt 
ktatctatag 
atatatatat 
atgtatctat 
gggaaacagt 
ggttattcac 
agcaagaaca 
aagacatgta 
ccaaaagtat 
taactgaaag 
tgtgtagcat 
caatgtgata 
ctaatattgt 
ctccttttat 
gaaaacttca 
gcaaaccgaa 
ctggccaaca 
gatgggcacc 
ggaggcggaa 
caagactctg 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
*1920 
1980 
2040 
2078 



<210> 62 
<211> 762 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (10> 
<223> n equals 



a,t,g, or c 



<220> 

<221> SITE 
<222> (12) 

<223> n equals a,t,g, oar c 
<220> 

<221> SITE 
<222> (42) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (219) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (747) 

<223> n equals a,t«g, or c 
<400> 62 

tS~f?H n a "^ ga9aaa ^ c ^^agag C g cccaatcgca anccgcctct ccccgcgcgt 
tgcccgattc attaatgcag ctggcacgac aggtttcccg actgaaaagc ggccag^gag 
cgcaacgcaa ttaatgtgag ttagctcact cattaggcac cccaggcttt acactttata 
ctcccggccc gtatgttgtg tggaattgtg agcggaLnc aatttcacac aggaaacagc 



60 
120 
180 
240 
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tatgaccatg attacgccaa gctcgaaatt aaccc^cact aaagggamca aaagctggag 300 

ctccaccgcg gtggcggccg ctctagaact agtggatccc ccgggctgca ggaattcggc 360 

acgagcacaa acgcctatac aaaacttttt tccatgtcct caatactttg tatcagatta 420 

ttcatatttc tcatggtttg gtttcttttt ctcgtcttca tttttctgaa ggtaaaaggg 480 

gattttctcc ccccttttct gatctgtaac ttgttttgta tttggatgat tacaggcgtg 540 

agccaccgcc tccagccaca aatacttttt agtcgccata agcataatca agaaatcatt 600 

ttgcaaatgg taagcttttc atgttgtgta ttctttccca tgatcagaga ggtaaaaagt 660 

gscctgggat gcataaagat gtcgtagacra tmtgcaatct gccacattat ttctttaaaa 720 

aaaaaaaara aoaactcgag ggggggnacg gaacccaggg eg 762 

<210> 63 
<211> 1094 
<212> DNA 

<213> Homo sapiens 



<400> 63 



tcggcacgag gtcaatcaag tgaaatatca tgtaaactgt ccagcagctt tgaaagtaga 60 

gaatgaacaa ggccccttcc ccacccaccc tgtggaaagc ccgtctggtt tggtgtcctc * 120 

ctggacagcg tettgeeggt cacctttggc catctcccgg tgcgtggttc agatgtgggt ' 180 

cctgctttcc cgccccctcc ctcctctgtg cctgcctgcc tctgctgtgc egggecagtg 240 

ccttggtggc cagtggagtg gacaccagct gcgactgcgt gggaggggct ggcattgccg 300 

ctgccactgc agggcttggg eggctgacat gggacgaggc tcgcacagct gccagctcct 360 

gtctcgctga ctttttttat acagttttgt ctgggccacc gecttcagtg ccacgggccc 420 

cttgccgttc aggctgctcc tcatagatga acaaggccct gccccgcgtc cttacccttt 48 0 

agagctgttt aaattcaaat gaactgaaac tgaata-gaa aaatccaggc cctcagccgc 540 

ccaggccatg tttcaagtgc tccatggcca catgtggctg gtggacagtg cagctctaga 600 

acattccatc accacagagg gttctgctgg acagtggcct tgggggctgt tttgagggtc 660 

cgcctgtcag tctcc-ggca tcaaagtcat tetgecattg tcaagttaca gttattttcc 720 

ttttacctcc aagecactat gtgcgtatgc tgctatgtgt ctgtatttcc egctaaaett 780 

cctgtcacgg agggaagggt gccacaggcc cagctcctgg agggggtttg gatgtctggg 840 

tggggggaag ggtgccacag gcccagctcc eggagggggt ttggatgtct gggtgggggc 900 
cagggcgcca cagttccagc teceggaggg ggtttggatg tctgggcggg gctttcttca 
tgttccatgt atgataaegg tgactggggg gtttacagag agaggcatac aaatagtgtt 
ggagtgctgt t^tegctaat ataaatgttt gaatagctaa aaaaaaaaaa aaaaaactcg 



agggggggee egge 



<210> 64 
<211> 1361 
<212> DNA 

<213> Homo sapiens 



<400> 64 



cccgggccga cccacgcgtc cgaatttctt tagtgtattt tatgtacttt* tgtattaatt 
acattgggta gactgttcca taaatgttga atagatccta tttgttggtt aatgctattg 
ttgtgtatcc tgetgacttt ctgtgtagtt ggtctatcag ttgttgggag aagagtgttg 
aagtctacaa ctataattgt gtatLtgtct attactcctt tcagttcttt cagttccatt 
tcacatattt tccagctctt gattggtgca cattaatgat tactatgtct tcttggtgaa 
tagaccattt tataatttta tgatgtacct ctcttgtcac tttctttget ctgaagtcta 
cctgatatta atatagtcac tctagctttc tttggagtaa tgttagtgtg atatatcttt 
tttegteett ttacttttaa ccctatctat atcatta-at tattttaagc gagttttttg 
tagatagcac ttacttgtgt: ctgtcatgtt tttaaatcca ctgtacaagt acaaatctgt 
ctttattttt catttttatc cttattttga gacagagtct cactttgaca ctcagcctag 
aatgcagtgg tgtgatctcg gctcactgca acctctgcct ccgagggtca agcaatcctt 
tcacctcagc ctcccaagta gctgggacta caggcatgaa ccaccatgcc tggctaattt 
tttttttttt tttttttgta gagatgggga tttaccatgt tgetgagget ggccctgaac 
ccctgggctc aaacgacctg cccagctcag cctcccagag aaatatgatt ttaactagca 
catctagatc atttactttc aatggaatta ttgatgeatt acaattcaag tctgtcattt 



960 
1020 
1080 
1094 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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960 
1020 
1080 



tatttattgc cttccatttt tccctccgtt tttcatttct gtgctttttt tctttgcatg 
taaacatgtt ttaggattct gttttgatgt atttagagta ttttttagtg tatcggttta 
tgtagctttt ctagtggatc ctttaagtat tacattatat atacataact aagaagtata j. Ub0 
tcagtgtcat cattttacca gttcaagtaa agtacagaaa tctttgcccc ttttacctgt 1140 
cctgcttata atgtaaatgt cttaaagatt tccttgacac acatttagaa ccatatcaca 
gtgttgtttt tgcttcagtx gtgaaatata atttagaaac tcatgagaat aaaaacccat 
tatatttatt tataaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa agggcggccg c 



1200 
1260 
1320 
1361 



<210> 65 

<211> 947 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (67) ; 
<223> n equals a,t,g, or c 



<210> 66 
<211> 1376 
<212> DMA 

< 213> Homo sapiens 
<220> 

<221> SITE 
<222> (18) 

<223> n equals a,t,g, 



60 
120 
180 



<400> 65 

ctttctatag ttaaagccgg tacgcctgca ggtaccggtc cggaattccc gggtcgaccc 
acgcgtncgg tttctgggtt ctaggccctg cttccttgcc cctttgctgc agaagggcag 
ctgaaggctc accctagaaa ccgggcctgg tgggtcttac ccggctcact ccctccrcttg 180 
nccttacaca tacaggaaga caagacctga gtggtgctgt ctttgtgtcc gtcgtgtatg 240 
gctctccctg tctzcatttc ttctcactct gtctctaaac ctctctctct ctcccttccc 300 
cctcagtact nartctacag acctatgtgc gtgtycctat ccttctgtcc ttttctctct 360 
tcagctctcc ctgcctctca cacacaattz tacatgcccc gaggagccaa gtztgggaca 420 
tttaccctcc aggcatccgt gtcccctct- gaagagaaaa cacacagctt cacacatcca 
ggcacagggg gcaagctctt ggggcatcag gaccctggag caccaggtcc ttcctggaat 
attagatcca cctggagcac caggtctctc taagtctcac ctggggaatt cggtcccacc 
tgggtcacca gttcccacct agagcactgt gtcetgccct agagcacaaa gacctgctcc 
tcccgagact ctctctgact gcagccaggc atagtaccct tgcctgtgtt tgctccctgg 
tccacagatt tggtggctgg gcaggtgcct ggacagtgat gaggtcttgc cgccttaact 
gtccccccca gtcacttctc ccacaggccc agcaggacgc agtcctgagg atcagggatt 
ctacagctgc attaaaatca atcctatcca aaaaaaaaaa aaaaagggcg gccgctctag 
aggatccaag cctacgtacg cgtgcatgcg acatcaagct ctgaaga 



480 
540 
600 
660 
720 
780 
840 
900 
947 



<400> 66 

ggcgggcggc gcggaagngg oggckgcgcg gccggggcag ccatgtcgcc attgtctgcg 
Scgcgggcgg ccctgcgggt ctacgcggta ggcgccgcgg tgatcctggc gcagctgctg 

cggcgctgcc gcgggggctt cctggagcca gttytccccc cacgacctga ccgtgtcgct 180 

atagtgacgg gagggacaga tggcattggc tattctacag cgaagcatct ggcgagactt 240 

ggcatgcatg ttatcatagc tggaaacaat gacagcaaag ccaaacaagt tgtaagcaaa 300 

ataaaagaag aaarrcttgaa cgacaaagtg gaatttttat actgtgactt ggcttccatg 360 

acttccatcc ggcagtttgt gcagaagttc aagatgaaga agattcctct ccatgtcctg 420 

atcaacaatg ctggggtgat gatggtccct cagaggaaaa ccagagatgg attcgaagaa 480 

catttcggcc cgaactacct agggcacttc ctgctgacca accttctctt ggatacgctg 540 



60 
120 
180 
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aaagagtctg ggtcccctgg ccacagtgcg agggtggtca ccgtctcccc tgccacccat 
tacgccgctg agctgaacat ggatgacctt cagagcagtg cctgctactc acccc^cgca 
gcctacgccc agagcaagct ggcccttgtc ctgttcacct accacctcca gcggccgctg , 2 Q 
gcggctgagg gaagccacgt gaccgccaac gLggtggacc ccggggtggt caacacggac 780 
stc-acaagc acgtgttctg ggccacccgt ctggcgaaga agcccctcgg ctggttgctt 840 
ttcaagaccc ccgatgaagg agcgtggact tccatctacg cagcagtcac cccagagctg 
gaaggagttg gtggccgtta cctatacaac gagaaagaga ccaagtccct ccacgtcacc 
tacaaccaga aactgcagca gcagctgtgg tetaagagtt gtgagatgac tggggtcctt 
gatgtgaccc tgtgatatcc tgtctcagga tagctgctgc cccaagaaac acattgcacc hhjo 
tgccaatagc ttgtgggtct gtigaagactg cggtgtttga gtxtctcaca cccacctscc 1140 
cacagggctc tgtcctctag ttttgagaca gctgcctcaa cctctgcaga acttcaagaa 1200 
gccaaataaa cattttggag gataatcacc ccaagtggtc ttcaaccata aactttgtga 
ttccaaagtg cccagttgtc acaggtgcca caaataatta cattttccaa cataaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaagggc ggccgc 



600 
660 
720 



900 
960 
1020 
1080 



1260 
1320 
1376 



<210> 67 

<211> 2434 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> SITE 
<222> (10) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (12) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (27) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (73) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (75) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (103) 

<223> n equals a,t,g r or c 
<220> 

<221> SITE 
<222> J130) 

<223> n equals a,t,g, or c 
<400> 67 

ctgggggtan tncaagaacc ctctgtngga ctcagatgtc aagctctttc ctttgggcaa 
cgtgtttcct ctncncgagc agtgtgctgt gtaaactaaa ttngccggtt cgcttcccat 
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ttcctgacan ttgagatgga atgccttgac ca'tggtgct ctgacagaga agtcatggag 180 

tcattgccat ttcctggttg cccttttgca atgtgaccct gttagtagag gttttctagc 240 

ttctactaag atatttcttt ccctaaccat catacacttg gcatgtttca ctcccatctc 300 

ctttcccctc accttaaagg agactacccc tctgccccat atcgtcaacc taattttctc 360 

tcgtactctc tctagtgaat gatgtgctac caagcatatg ccaggctgtg acaggattat 420 

actgagtagt agaaagaagc taatttgaaa taaaaattat ttgtataatt aagaaagcag 480 

attagatgca catggtcaac aggaagttga ctgtatgtct gctagttaga ttcaaaacat 540 

cataaagacg atagcatgtc aatatattag cctagccatt atgttagcct ttgttaggtg 600 

ggcagctttt ctgctttttc ccttcctctg tggtgacaac ggaggaaara tccaacagaa 660 

atacgtctaa cagggaaatt gggatcatag tttatatgca tctgacttga aaggagtatt 720 

gaggaaggtt ttcatacatg acctatcttt ggattaaaaa gaacatttat gaaatcaagc 780 

cttctaacac tagttataat tgagaagcaa cagtaactcc gtggacagca atcaagctta B40 

aaattgtaaa taaatatggg gataattcag ttgttgcaaa aaaagggcag aattcagtag 900 

aataaagtcc ttttctctta caggtattaa atgaggacag agaacctcag gtgttcttat 960 

gctagtgctt gctgagtgca LacUagaaa gcaattccaa acagatgcac acatctagag 10 20 

agagtggcat tagagartca gtgtatgtat ttatctacat gagaggaaac tggaatataa 1080 

tcccacaaat tattggaata taatcccata aattatcacc ttttatgact ggaaaatatt 1140 

tgccaatgaa gaaatggtct 'gtaggtattt gtcttaagat ctttggctgt ttaataaaaa 1200 

tgtaacttta acggtttctt atagttgcct t-ataaagtg tattgtctaa aatatttLLg . 1260 

tatcatgtgc ctttgaaatt tgacagctga tctgggtgtt ggatxtccgc ccagccatct . 1320 

accagtatta tcattttatt cagtagctgg caggtgtatt agacaaacga gacttaggta 1380 

^?!__?? aa ^tttcctgt ggtttgactg cacatcacac cagaagactc cagtatccct 1440 

1500 

tggaatatgt gaatttctac ctctagaagt tgtgaaatag gccctgcact tttgcagaat 
gtccttcttt aaacccggcr tattccacag ctgtagctga taacacgacc tggggcttag 
ctgctctagc cctgggttct tcgagacctc acactgcctg gcccctggcc atccacctiaa 
ggactgcctg ctttctggtc acacgtggac cttgataega ctaagcggtt acatatgtgg 
ttgtgcaaaa gcrttctgtt taatgcatag tgttaccgat ttacatcttg gttttcagtg 
gcactatgtc taggaggcaa tatcctttta aacagtgctr cggctaagac agatacttgt 
gaatcaaaga tagcacagaa atgaactaag tatatccca- ttggaattat attttgatac 
tatttaaaat ggtttcacct gttaaagggc caacagaact cttggtttta cttttgtaat 

tactgtacag aaaatttcaa gagtgtttga gtgcttgtca tcaggtgttt tccttaataa « iWV , 

gcagggatat gatcattcac aggaattata tatgaaaaaa gtttttgaaa tgtatttttg 2160 

" 2220 

2280 



cattccagaa tgaggaaaaa gtattctaca aagaacctaa tcacctctgt gaaatctatg iJW u 
ggatggaaac agtgtggcct taggagtcaa atagtctctg catggtgggg aggatcatga 1560 
h , _ _ ^ 1620 

1S80 
1740 
IB 00 
1860 
1920 
1980 
2040 
2100 

j H — 3— — vvjvuULVLLy 

tgatgtgcca tgttgagggg aaaccaaata cztacgattt caaaacattc gtacgaaaac 

attgtacaat gtaatatgct caactttctc aattttttgc taatttttct aagatacatt ^ ov 

aaaaatgttt tatatttttt tttaagtaaa acggacccag taagaaaatt aaaaatacca 2340 

gaacataaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2400 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 2434 



<210> 68 
<211> 1086 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITR 
<222> (10) 

<223> n equals a,£,g, or c 
<220> 

<221> SITE 
<222> (77) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1056) 
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300 
360 
420 
480 
540 



<223> n equals a.t.g, or c 
<400> 68 

ttgaaacacn cttttgagca ttaggtccca gctccaccgc ggtggcggcc gctctagaac 60 
tagtggatcc cccgggntgc akkaattcgg cacgagcaac ctgtagtgag ctggctgtca 120 
ctcagcagct ctggaaaatt acctgtctgc actgaatttc ctcatcagta aaatggaaat 180 
gattatagta ctgaccttgt aggatcattc taaaaatcag agaagttcat gcagcttaga 240 
acagtgccag gcacatgg^a aatgctatgg catacttaag catcttcctc tgtggtgcct 
cgtcatcacc atgtgattgt gccttgcfctg tccctgtttc acttttcaga gggagaaaag 
tggcceactt taagaatcaa aattctgatg ttacttcggg aaatgcatag agccagagag 
acacaatttg acttagtatg atccacatca tcccctcagg ctgaatagtg gtggcatgca 
catctatacc aaaatgtttt accttttttg tagaaggaaa atatttgtat cttctattcc 540 
atatcttaga tctttataag agcacctaag ttcaacctcc taagaaactg ccaattttgt 600 
tgatcatgat agtctgcaca gattttcgta ctatttagtg ktgggagtgc cttagggacc 660 
atcaacaaca ggscccLcct tttatccatg agactactga ggccttggag gttatttgtc 720 
catccatggc gggtgcacrg ctagaggtta tctggttagt agcccaacta agattagaac 780 
ccaggaattt tgatttaatc tcaaatgacc tcttttattt ctgcatccgg gaaggagaag 840 
aggaaagtaa acgggaaatt cafcgttatct atggaaaagt tatcagtttg atgtutatta 900 
aatgatttgt ttcaggagat tgtttacaac ttttatcttc ctagacaata attttctgta 9*0 
agagtaaagc catggtcatt aaggtagtca tattaatgca ttcagtaasc tgtgaagaaa i020 
aacatatatc aatgttttcc aacaaaatac agtgantacc rgaaaaaaaa aaaaaaaaaa 1080 
aaaaaa 

1086 

<210> 69 
<211> 1262 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (568) 

<223> n equals a,c,g, or c 
<220> 

<221> SITE 
<222> (639) 

<223> n equals a,t,g, or c 
<400> 69 

cctyctatag gkmaagctgg tackcctgca ggtaccggtc cggaattccc gggtcgaccc 60 
acgcgtccgc ttacaacttc tctyttgttt ctgtgtttct tccacaccat tatggcgtat 120 
ttactgatct acaaccggtc ttaagttagt tgcctgcrta aaattttctc atgaattctc 
attacatttt caaacaaaat ccaaagtttt taccatgacc ttgtggauat ctttctgacc 
tcatttacta ctgttacctt ccttattcat tqca^tccac ccacagtggc cttcttgcaa 
gtcagagaat gcaacaaata tgctcctgcc ttggggcatt tgcacttctg ttcttttggc 
ctggtcactt cacctccaca ttctccattt tttatgactt tcttcctata tttggatctc 
tgttcaaatg tcacccctca aaaaggcctt caaaactacc ctatttaaaa tagcacctgc 
cactgccatc tccataccca ttctcatttt tccatagcac ttatccctac ttggcactat 
gttatatatt tgtctattcg kttactgnct gtctccccca ctggactgta agctccatga 
ctgtaagtac atagatgaat tacaaaatga atcaatggng aawtatcctt gmcatatatt 660 
atttaatgat tttgmctcca ntntaagtaa aaaaaaaata ccattttttt cactc-tcaa 720 
agtgatatag ttsaatcttc taaactacat ttttctcatt tcccgattta attaaatcag 780 
tgatataaaa aaaagagtga tggggatatg tgaaagaaga cnaaaataga tgccaggaaa 
tactaaaacc gctgaagttt agtggtatat tttttcttta cacacagcat ta-ttgagtt 
actaattgtg tcactgaatt acagaataag caaaatacta ggtaaacaga atcacgcttg 
ggggctatat tttgtgtaaa atttgtgtta tgcaaaaata atattaaata tttaattact 
acagttttgt tatttctttc -tattttagg aaatgatttg cagctgagtg aatcaggaag 1080 
tgacagtgac gactgaagaa aLatytagct ataaataaaa acttatacag cacgtataat 1140 



180 
240 
300 
360 
420 
480 
540 
600 



840 
900 
960 
1020 
1080 
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ttattttgta ttaacaataa aaattcctaa gactgaggga aatatgtctt aacttttgat 
gataaaagaa attaaatctg actcagaaat ttcaaaaaaa aaaaaaaaaa aagggoggcc 1260 
gc 



1200 
1260 
1262 



<210> 70 

<211> 1642 

<212> DMA 

<213> Homo sapiens 

<400> 70 

ggcgcgctgc ccggarctgc ctgggttgcg ccgccggcca cgtccccgcg ccgggcctca 60 

ggctccttcc tactgtccga gggccaccag gccgccgggg gcctgctgcg cccggatgcg 120 

tctgttacta gagtggagag tctaccttcg tctcacatgt gccacaaagg atggcatggc 180 

ccgggagtgc cccaccacgL ggctzccacc ccctgcaaag ccagacttcg cccagcgaca 240 

cagtgtcaag cccacagctc tccaaggagg aagatggrcc aggctgggag catcycctta 300 

gcagcagcct: ctgatccctt gaccaagcag gagggaacca ttagcagcct gaggagctgg 360 

CtaactCaaa Crectcacrerna nrnrrrart/i^ *- — ~ , ■ . 420 

480 
540 
600 
660 



ctggctggga gcctcgggga ccgcccagcc ttgctcccag ctcacccaca agatgtggac 

agctcttgtg ctcatttgga ttttytcctt gtccttatct gaaagccatg cggcaLccaa 

cgatccacgc aactttgtcc ctaacaaaac gtggaaggga ctagtcaaga ggaacgcatc 

cgcggaaaca gttgataata aaacgtctga ggatgtaacc atggcagcag cttctcctgt 

cacattgacc aaagggattc ggcagcccam ctcaactcta tggaagtcac aacagaggac 

^ g !^? t ? S ? t ^ aacca ^ ca actzcaggag gtgcagctga tggtgtgacc 720 
_ ~- 780 

840 
900 
960 
1020 
1080 



tccattgctc ccacggctgt ggcctccagt acgactgcgg cctccattac gactgcggcc 

tccagtatga ctgtggcccc cagtgctccc acgactgcag cctccagtac aactgtggcc 

cccatcgctc ccacgactac agcctccagt atgactgcgg cctccagcac tcccatgaca 

cttgcaetcc ccgcgcccac gtccactcyc acagggcgga ccccgtccac taccgccact 

gggcatccat ctctcagcac agccctcgca caagtgccaa agagcagcgc gttgccaaga 

acagcaaccc tggccacatt ggccacacgt gctcagactg tagcgaccac agcaaacaca 

-2™™°! tga ?^^?? tccaagtcct tccaagcaca tgcccagcga caccgcggca 1140 
" 1 - 1200 

1260 



- — J — i» — — a i.ui.v.yv^ijyi.Q 

agccctgtac cccctatgck tccccaagca caaggtccca ttagccaggt gtcagtggac 
cagcctgtgg ttaacacaac awataaatcc acamccatgc cctcaaacac aaccmcijgag 

cccctcaccc aggccgtggt agacaaaact ctccttctgg tggtgctgtt actcggggtg 1320 

acccttttca tcacagtctt ggttttgttt gccctgcagg cctatgagag ctacaagaag 1380 

aaggactaca cccaggtgga ctactcaatc aacgggatgt acgcggactc agaaatgtga 1440 

ggggggcggg ggcctggcgg gaggcctggc cccttcctcg tcctctcctt ttgcctttga 500 

gaccaaacca agtgcttcca aattcttttg gtgcaattga ggagatatgc cagatgctta 1560 

aacacatzta attigctgtca gatraattcc atgatcacta aagagctgct gcttttttca 1620 

taaaaaaaaa aaaaaaaaag gg 1642 



<210> 71 

<211> 921 

<212> DHA 

<213> Homo sapiens 

<220> 
<221> SITE 
<222> (4) 

<223;> n equals a,t,g, or c 

<220> 
<221> site 
<222> (9) 

<223> n equals a,t,g, or c 

<220> 
<221> SITE 
<222> (11) 
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<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (15) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (20) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (901) 

<223> n equals a,t,g, or c 
<400> 71 $ 

gcgnggggna r.aggnaagcn ccccactatt gggttcaaaa gctggagctc eaccgcggtg 60 
gcggccgccc tagaactagt ggatcccccg ggctgcagga aLtcggcacg aggtctgagc ' 120 
agataagatt aagggccggg tctgtgctca attaactcct gtgggcacgg gggctgggaa ' 180 
gagcaaagtc agcggtgcct acagtcagca ccatgctggg cctgccgtgg aagggaggtc 240 
tgtcctgggc gctgctgctg cttctcttag gctcccagat cctgctgatc tatacctggc 300 
atttccacga gcaaagggac tgtgatgaac acaatgtcat ggctcgttac ctccctgcca 360 
cagtggagtt tgctgtccac acattcaacc aacagagcaa ggactactat gcctacagac 420 
tggggcacat cttgaatccc tggaaggagc aggtggagtc caagactgta ttctcaatgg 480 
agctactgct ggggagaact aggtgtggga aatttgaaga cgacattgac aactgccact 540 
tccaagaaag cacagagctg aacaatactt tcacctgctt cttcaccatc agcaccaggc 
cctggatgac tcagttcagc ctcctgaaca agacctgctt ggagggattc cactgagtga 
aacccactca caggcntgzc cntgtgctgc tcccacattc cgtggacatc agcactactc 
tyctgaggac tcttcagtgg ctgagcagct ttggacttgt ttgttatcct attttgcatg 
tgtttgagat ctcagatcag tgttttagaa aatceacaca tcttgagcct aatcatgtag 
tgtagatcac taaacatcag cattttaaga aaaaaaaaaa aaaaaaarct cgaggggggg 
nccggtaccc agggcggaag a 



600 
660 
720 
780 
840 
900 
921 



<210> 72 
<211> 906 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (34) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (833) 

<223> n equals a,t,g, or c 



<400> 72 

ggaaattctc cctcactaat tggaacaaaa gctngagctc eaccgcggtg gcggccgctc 60 

tagaactagt ggatcccccg ggctgcagga atteggcacg agggaagaga gaggggaggg 120 

tgagcagagg acaggceggg agttttccgg gaacggagga agagcagtgg aggctgecag 180 

gatgaggctg ctgtgtggcc tgtggctgtg gctctccttg ctgaaagtcc tgcaggccca 240 

gaccccaacc cccctgccac tcccgccccc gatgeagage ttccaaggaa accagttcca 300 

gggggaatgg ttcgtcctgg gcctggcggg caacagcttc aggceggage acagggeget 360 

getgaacget ttcaccgcaa ctttcgagct aagtgatgat ggccgctttg aggcgtggaa 420 
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tgcgatgact cgaggccagc actgtgacac atggtcttat gtgctgatac cggcagccca 
gcctgggcag ttcactgtgg accacggtgt gggcaggacc tggttgctgc ctcccgggac 
gctggaccag ttcatctgcc tgggcagagc tcarggcctc tcggatgaca acactgtctt 
cccagatgtg actggargtg ccctggacct carcagcctg ccctgggtgg cagccccagc 
ctgaccactc agacagccgc ggcccccaag gcctgactct tcttgtggga gggcgaggct 
ggtcacccca ggccagcgtc tgtcgaagga tgaagcagcc cctgtccggc ccagccctgc 780 
ctcacagctg tgcgagctct gccctcctca gctctcaaac ctgaataaat gcnccaagcc 840 
cagaaaaaaa aaaaaaaaaa aaaaaaaaaa ctcgaggggg ggcccggtac ccaattcgga 900 
agattg 



<210> 73 
<211> 6B0 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (7) 

<223> n equals a,t,g, 
<220> 

<221> SITE 
<222> (9) 

<223> n equals a,t,g, 
<220> 

<221> SITE 
<222> (15) 

<223> n equals a,t,g, 
<220> 

<221> SITE 
<222> (16 ) 

<223> n equals a.t.g, 
<220> 

<221> SITE 
<222> (22) 

<223> n equals a,t,g. 



480 
540 
600 
660 
720 
780 



906 



<400> 73 

cactcantng aacannagct cnagctccac cgcggtggcg gccgctctag aac-agtgga 60 

tcccccgggc tgcaggaatt cggcacgaga tatttcgctg gaccctagaa aagccaccac 120 

gacctgtggg ccatgatgct accccaatgg ctgctgctgc tgttccttct cttcttcttt 180 

ctctccctcc tcaccagggg ctcactttct ccaacaaaat acaacctttt ggagctcaag 240 

gagtcttgca tccggaacca ggactgcgag actggctgct gccaacg^gc tccagacaat 300 

tgcgagtcgc actgcgcgga gaaggggtcc gagggcagtc tgtgtcaaac gcaggtgttc 360 

tttggccagt atagagcgtg , tccctgcctg cggaacctga cttgtatata tccaaagaat 420 

gagaaatggc ttagcatcgc ctatggccgt tgtcagaaaa ttggaaggca gaagttggct 480 

aagaaaatgt tcttctagtg ctccctcctt cttgctgsct ccrcctycty cacctgctct 540 

cctccctacc cagagcLctg tgktcaccct gttccccaga gcctccacca tgagtggagg 600 

gaagtgggga gtgattgaaa taaagagctt tttcaatgaa aaaaaaaaaa aaaaaaaaaa 660 

aaactcgagg gggggcccgg 6 



<210> 74 
<211> 1633 
<212> 3NA 
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<213> Homo sapiens. 



<400> 74 
ggcacgagca 
ggagaattga 
cagggagaag 
catccrgggg 
gggctcaggg 
cagtcctcct. 
cctgcgtctg 
ttgccctttc 
ggatctgctg 
ggaactgccc 
taatctgcca 
tccacaccct 
- ctgaggccgt 
gctcccagcc 
gggtagcctg 
gaccfcctagg 
gctaatgttt 
Cttctgcacaa 
ctaaggctaa 
aggattcaga 
cctcctccgt 
ccctggtctg 
ccaagctttg 
accttcagga 
acccttcttc 
cagtggtaac 
tattaattga 
ccgtcgacct 



tcagtaagta 
-azagtagct 
gagggaggag 
atagcacagg 
gtgggggagg 

gctcccgctg 
tggccggttc 
tggtcctccc 
acagactttc 
acggctgtgg 
ctcccctctg 
tgtacactgc 
cctgccagca 
tgcctcatct 
gatgggccca 
gaagggcagg 
accgagaact 
ctaccccacc 
ccaatagaaa 
accaggtgct 
gacctctggg 
agatccactg 
gatactttct 
gcggcctact 
catcgtgtcc 
at tgttaaac 
tggatctacc 
cga 



ggagctaggg 
ggacag&act 
ctacccccac 
ctcaggcctg 
ggctgcacca 
ctgcgtgccr 
ctgccaaccr 
cccccctccc 
ttcttgctgc 
tccacctgct 
cagccgccaa 
attctgcctg 
cctaccgcag 
gctgtactta 
5 tggcacactc 
gtctggagcc 
taggatgtga 
aggtaagtac 
tgaacctgcc: 
ttacttcagg 
gaaggcccag 
actcacaggg 
tctggattcg 
agagccacac 
tc-cccttag 
aaaacaggaa 
ttcctagctc 



aagagaaagc 

ctagatgtgt 

gatgacctcc 

ccctggttgc 

cattaggacc 

gggctctggt 

gtctagtcct 

cgctccccat 

ctgcctgctt 

ggragcaacg 

ctggtgctga 

Lcactctgga 

ccccatgggt 

cagagccaca 

actcccagag 

caataataaa 

taggctcaat 

tattaatatc 

taatgtccca 

gctgaagctc 

caacaccctc 

gtagggttaa 

gagaaggctg 

atttcccagc 

gctcctggaa 

aatgggactt 

gtgccgaatt 



atgcaaaagg 

gtgtgtgaga 

aacttccctt 

cctggcgatt 

tgccgtactc 

catgccaggc 

ttcaggcttg 

ctagcctttt 

acatttcaga 

ccctgttacc 

gctgccacct 

cagctgccca 

ggtcctgtcc 

tctggttgaa 

aggcgggcat 

acacagtaat 

gacttaccag 

ccctttccca 

tgggcataat 

ataaccacta 

tattgcgtcc 

ggttaaggcc 

gaataactga 

tctccttgtg 

agttttcaga 

gtgtgtatat 

cgatatcaag 



gcaagaatca 
gaaagagagg 
tctgcaccct 
ggcctggcct 
caatcccatg 
ttcctzgtgt 
aggccctgca 
ttgggttccg 
agacccctct 
aaatgctaga 
gttgtctgtt 
gcagctgcag 
agttcctggc 
tacagagctg 
cctctctctg 
gataatcata 
aattactcta 
cacgaagaaa 
tcatggagcc 
gaccaaagcg 
aatgacttcc 
agagtctcag 
atttctgctc 
tgf.ttcccag 
gagaatcacc 
atgattaaac 
cttatcgata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
- 960 
.1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1633 



<210> 75 
<211> 1022 
<212> DNA 

<213> Homo sapiens 
<400> 75 

ttcccgggtc gacccacgcg tccgcccacg cgtccggctt ggggccagca ccctgtctca 60 

aagatggcaa aatgaggcta gttctggatg agctagctgg tgtgggttcc aaccatagga 120 

acacactgat gcncaaatcc taaggtgcca agctctaggc cctggaggct ggtagaacag 180 

gatctatgcc tggaatcctg gcagggattc ctgtcaagga cttgtgttta agcctgcttc 240 

agggcttcag gctgcttctg ctctgtgtct gcccaggctg gctgagcggg tggatgggtg 300 

gacagaaggg ctcaccaagg attgtggaca tagggtaggc cctggtacca cgggtttcag 360 

gctgctatca cttcccttgt aggaacatag ccagaagcag atgagccagg gtagagggct 420 

ggcccctcct ctcatcrtcc cttcagtctt aaattgtctc cagcgatggg aagaggccag 480 

ggactgtaac ccttgtgcr.g tgtattctct gagcctctgc tcactctcag ggccaagcaa 540 

ctcccaagcc ggggccctc: cttggccaaa atctgaggag cagtctaggt tacaggcttt 600 

tcggtaggta ggttctggct gcctgttaat gcagttaggc cccctgatta ggcacagtga 660 

gaaacaagcc agaacaaccc tggcccagaa gaccgtgcac tccagcaaga tccagggatg 720 

atagccttgc agggccac^g ggagtttgtg cccaagcttc tccctcttct ctccccaggg 780 

ggcactggga ctggtccccg ccctcatcct tagcctgggc cttccccaga cgtatraaag 840 

agaagcatga ttcctctgtc ttcagtzctt ttcaggggca tcctgcccat agtacccagt 900 

tcccaagggg cccccagcca cgtggtgaag cctagcactc atgcagctzct tagggaacca 960 

aaaaccagca ctgaaataaa gctgaatgac tgactgadaa aaaaaaaaaa aaagggcggc 1020 

Cg 1022 
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<210> 76 
<211> 1184 
<212> DNA 

<213> Homo sapiens 



<400> 76 

agcaaaaggt 

accaatttcc 

tggagctctt 

gaattccact 

ggtttgtttc 

tgcatacagg 

aagtatattg 

catcaggcat 

tctcaggctc 

acagccagct 

atttctatcc 

acaattcccc 

tgcctggcac 

aaataatgag 

tgacagcctc 

cccctgggga 

ccgtctctac 

gctactcagg 

gccaagattg 

aaaaaaadaa 



gaagagagaa 
ttcggtttaa 
tccataacaa 
gtgtggatgg 
caatcttttg 
tttatatgta 
tggacagaca 
gtgtgactac 
aa tgcagaac 
ctgccacatg 
tgcacagata 
agaaacacat 
ctcaaagcca 
aaataagaaa 

aggtggcaag 

agatcacgag 
tgaaaataca 
aggctgaggc 
cgccactgca 
aaaaaaaaaa 



aggaagcttt 
tgtacctgca 
cacttggaaa 
aacatactct 
ctttutcaga 
ggaaaaattc 
atgccaaatt 
caaagctcct 
tttgaggtaa 
aaggagagcr 
tattttccac 
ccccctgctg 
agcaccatgg 
tttcagcttc 
cagctactct 
gtcaggagat 
aaacattagc 
aggagaatgg 
ctccagcctg 
aaaaaaaaaa 



ttctctaaaa 
tctcggggtt 
gcactctcat 
atttcaccag 
gtaataacct 
ctagagtagg 
gcctttcaga 
gtcagctgtt 
gcttfctctaa 
caaacgagac 
attctgggaa 
gtacagccaa 
atgccacttg 
tcagtccctc 
gccggacagg 
caagaccatc 
tgggtgtggt 
cgtgaacctg 
gcaacagagt 
aaaaaaaaaa 



atggtgcagc 
tttttccact 
ccttttttca 
tcccctgtag 
tgcacgccta 
attgctggac 
gactgtggcc 
ttattttatc 
aatgtaggct 
agaaacagcc 
accgtgaagc 
gccccagaac 
ccatgggtgc 
tagccaacat 
gcagaagatg 
ctggctaaca 
ggcaggcacc 
ggaggtggag 
aagactccat 
aaaa 



tatcctctga 
taatttatcc 
ctgctgaaca 
ccagtcactt 
tcattttgta 
caatggataa 
ctgtgcaccc 
tcctttccag 
cctaaacgcc 
tctgggcagg 
ttccagagcc 
aagctgtgct 
ctgcaatttc 
ttcaggtgca 
gaacacccca 
cagcgaaacc 
tgtagtccca 
gttgcagtga 
atcaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
- 780 
< 840 
900 
960 
1020 
1080 
1140 
1184 



<2lO> 77 

<211> 312 

<212> DNA 

<213> Homo sapiens 



<4C:> 77 

aatccccggg 

agggccagcc 

ggtgcacttt 

tctgaaacct 

ctgtctggua 

agggcggccg 



tcgacccacg 
cacccagcat 
gtgtgtgcct 
acagaaccac 
gctctgctcc 
ct 



cgtccgtgat gagtggattt gtactcttac ccaggtcctg 
ccccaccccc gatgacgctig tccctacaac tggctgaact 
tccagagcca gtggactggt gtgtatccaa tgatgccacc 
ta^gctttgc atgtgz;accc tgcagggtct. gagggccagg 
tgggtgacag agcaagactc tgtctcaaaa aaaaaaaaaa 



60 
120 
180 
240 
300 
312 



<210> 78 

<211> 1370 

<212> DNA 

<213> Homo sapiens 



<4Q0> 78 

tggttaaaga 

ttatatgccc 

gcatccacca 

aaaatccttg 

tctaacctgg 

tggactatta 

gcatccacca 

tccatccttc 

atacctacct 

actcccatca 

etgccaaagg 



gtacgacatt 
ttggaagcat 
cgccaatcca 
gggctatcta 
gccctgacct 
gccctaaact 
agtctggctt 
tgtggaacat 
ctcccttcaa 
gtacaccccc 
Ugcctgggct 



ttagtcaggg 
atggggactc 
aggcagctga 
ccaacatgtc 
tagcacagat 
tcttctgtac 
tcacacccga 
cagtgtcact 
agtcccggat 
aagtccttcc 
ctacctgcca 



ctgagcacag 
cattCctgtc 
tggctcagga 
tccattccac 
ctgcctatat 
tgccactgca 
accccacacc 
tagcaatagc 
gcctgataca 
agtgaaagat 
ccagtgatgg 



cgccrggccg 
actgcggcga 
aagtcagaga 
ctgttagggt 
ttttttgaag 
gggctgaagg 
tcgcctccac 
catacaatcc 
ttgcacctgc 
gtaacaattg 
ggtcctcaca 



aggagtgctt 
ttc=acttat 
ctgagatgtt 
taaaggtttt 
ttcagccacu 
agccttgcac 
cct-agctat 
cattatcctt 
tagtgcattc 
aatggccact 
gccaacccag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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tggtgtatca tccatttcca daaccatctc amtgcctgct ttcttggacc actcctggca 
tcaactgttr fcaggttagag tgactgaaaa tttgggktat aagatattta ttagagatca 
atatctatta aaaatgngaa aggaagcagg attktgctga ggaagaagat aacaacaaag 
ataccacaat gtcagcccat caaagccttt ggccaaccca gcasaaaaac ctggagcagg 
tgtayctttt tcagagtctc cccagttagg tcaaaatgtg cgcctttaca cccgcacttc 
r^^" C ^gcctcagg ctgtcctggg catgacctca gatgaagcgg ctcacacagc 1020 
tgaggctaac gttgtcggag ctgacagctg aaagccgttt gccgaccaca ctcccacagc 1080 
cgagcagcat gcccttgctt ggaggaggat ttggatgaca cagctctatg tctgccgtat 1140 
tttgggggtc acattcctca cacccagctc tgcctcaaa; ggcaacattg acaaaaattc 
attccagttg caaaccaact gtagaatact ataaaaccag agtacaagtc taatagcata 
aatctcatcc taagtggaaa agaaagggtc atttattcac acatttttgg ggaaaaaaaa 
acaacctttg ctatgtcttt attacaacac ggatactcac aaaaaatagt 



720 
780 
840 
900 
960 
1020 



1200 
1260 
1320 
1370 



<210> 79 
?<211> 368 
<212> DNA 

<213> Homo sapiens * 
<220> 

<221> SITE 
<222> (5) 

<223> n equals a,t.g. or c 
<220> 

<221> SITE 
<222> (13) 

<223> n equals a.t.g. or c 
<220> 

<221> SITE 
<222> (35) 

<223> n equals a.t.g, or c 
<220> 

<221> SITE 
<222> (351) 

<223> n equals a.t.g, or c 
<220> 

<221> SITE 
<222> (367) 

<223> n equals a.z.g, or c 



<400> 79 

aatgnaactc tcnttactaa ttggaaccaa agctngagct ccaccgcggt ggcggccgct 60 

ctagaactag tggatccccc gggctgcagg aattcggcac gaggtccagg tgtctttatt 120 

tcagatgttc tflttf.i-.t-.rct. ctattttctg ctcacatgaa catacacact tgccaggcac 180 

ttCttatttc tcrttctgat tttgcctcct tcctgcccct gttttcttcc 240 

»^ C ^ 9 gctatagaaa ctgtcaggtg gccttgctgg catcatccta cctcttttga 300 

1 &C ccaggaactt ccatttatta tgcctcgagg gggggcccgg nacccaattc 360 
gga agaric, 



368 



<210> 30 
<211> 1088 
<212> DNA 

<213> Homo sapiens 
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52 



<220> 

<221> SITE 
<222> (1) 

<223> n equals a,t,g. or c 
<220> 

<221> SITE 
<222> <4) 

<223> n equals a. e.g. or c 
<220> 

<221> SITE 
<222> (5) 

<223> n equals a.t.g, or c 
<220> 

<221> SITE 

<222> (9) ? 
<223> n equals a,t,g, or c 

<220> 

<221> SITE 
<222> (11) 

<223> n equals a,t,g, or c 
<400> 80 

60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1088 



nttnnaacng naaatctccc cttactattg ggaacaaaag ctggagctcc accgcggtgg 
cggccgctct agaactagtg gatcccccgg getgeaggaa tteggcaega gattccccag 
aattggttca tcttgggaaa ggaaggecta ggaagctgat gacctacttg ttttgttcat 
ccatttctct actgettett aaagtgcact caageggtea ccaggacatc agaaaageca 
aatccaaggt cccaaggctc ttgatcatcc agtgcccaca acagagggag taagctagac 
ggagaagctg aagtgaagga ggaaagagag aagggggtgc ctgttccacc cccttcctag 
cagctgcctc tgtgggctcc ataatcccca tgccctctcc cccaccccac cctccggccg 
mccacctccc ctccacccac cccattcatc agtaggaggg cactgtatca gcacagcctt 
teaaggectg acagttcctt tctgggtgcc ccccacatgt aaccacttaa cccccagcac 
aaagggccag cctgaatggg cctggagccg gtggctcgtg cgraggggga ggectcaata 
ggatttgggg agctgggtgt ataaaccget ctgctccagc agccctcaat ggaggagctg 
gagttggggc atgtgccgag gggaggagct cggcagctgc atcccgsccc tgaacaatga 
t.tattcatt ctcctggcac gcacagagaa gacaggctgg ggaaacatgg ccccacaaat 
cgccctcagc ctcagctgat tcccacggca ateattgeag ttcagagtgc cccctcccac 
™£! a 9 ^ " gcctcaa ^ cttgcccttc ctcttcctgg taaggaagta gactacatta 
egggaggect taagtagect taaattcagg caagcatcgt tctttgagtt ggttgcaccc 
aaactcaatc tcactctctc aaagctttct ggccaggttt tcctcacaca cccatagaaa 
aagat?r ga ° aaaaaaaaa aaaaaaaaaa actcgagggg gggcccggaa cccaattcgg 



<210> Bl 
<211> 1862 
<212> DNA 

<213> Homo sapiens 
<400> 81 



ggcacgagct ctgcaaagct ccctctgctc cagcacgtcc tgtatttttt caacttrgga 60 
tgtttgcaca agctatcatc ttttgeaaga atgtcctgcc ccctcctatc catctgtgga 120 
agtcaacctt tctttagggc ccagaggaaa acctatcacc ttctacacct tcccagatgc 
ccaccattcg ggtcaaactc tgccccctcc aggtttgcca tggtcttttc cctctcttaa 
agcactggtc ccagccratg ccattgtgtg tcaccttggc tcctgtctcc tactggccgt 



a-^t-^f- - 3 * ""^^yy- ^ttgucccc caccggccgt 

a y atcttugg gargggagga aceggggaat tttcatcctg gggctacggg acagtgetta 



180 
240 
300 
360 



WO 00/06698 



53 



PCIYUS99/17130 



cccctaagta 

tccaacagcc 

cagtczcatg 

cctgctgggc 

gggccctccc 

gggtgatgag 

ccctcaagta 

ttctgcnccr 

tcctcccttt 

caaggtgttc 

ttcaacggcc 

tcttcacctg 

ctgagacaat 

aagttgttaa 

atttttctct 

ggtgac tgsg 

aggagttgtg 

cccaacaatt 

ggctgctttg 

ctttaattga 

a-atttggaa 

ccacgtgatg 

atgtaaagac 

taaaagaagt 

ctagcctggg 

ag 



caaaataagc 
cccaaagatg 
gatttagcag 
cagctgtggt 
tcctggacct 
catccggaag 
ccctactcac 
catcctaccc 
tctctaccat 
tttccatacc 
acccactgcc 
gcccttttga 
cttaactttt 
atggtgatta 
accLLccctg 
caaccaggtt 
taaatgaaat 
gcagctttgc 
ccmtggaagt 
ctttgtcatc 
gcaggttcaa 
cagcttgttc 
tcaaaaaggg 
gttgtccagg 
ccatagaatg 



aagaatcctt 
gggtagzcca 
ccttcgtagc 
ctctgagaat; 
gggtccatg- 
tcagcttgct 
atggtccaag 
acccagctgt 
ccaccccttt 
aaaaaatctg 
ctcaggataa 
gataagagga 
gcagctgtcc 
atttatctgg 
ccgcacattg 
cactaaggca 
|sgccgataat 
tgcaacatgg 
ggggttacag 
ttastgagaa 
tacacaacac 
atttc^gaat 
gatiggcatgt 
aggtggaggt 
agactccttc 



gaattgaaaa 

ctgctcccta 

cccaggacct 

gctcagttcc 

gaccaagtgg 

tgtctgccct 

agggaaccac 

tctgggcaag 

gatcacctaa 

atcaggccat 

aaactaaaat 

tttataatct 

aagctattta 

cc cgaatggc 

ggacacagca 

gtcagcgcag 

accaaaggaa 

cagggtggga 

ggcagggaga 

cttacaaatt 

agtccnccct 

tccttttaat 

aaactatagt 

tgcagtgagc 

tcaaaaaaaa 



gctccagtga 
tataggaagg 
ttcccaacgt 
ctgggctagg 
ccagccagcg 
aatcacagcc 
tcttctgtca 
atcctgtggg 
tcctrcarct 
cactcctctg 
tctcaacagg 
tttcaaagcc 
agattgatga 
akgtgaattc 
aggccaggac 
agaagaagca 
ccccaagaga 

ca ggggtagg 
ggcttaccac 
gagctattct 
ctagaatttc 
gacaaaaaca 
tggaacaata 
tgagattgca 
aaaagaaaaa 



cttgatagct 
aagaggttca 
ccccagttct 
ctggcLtacg 
agagagccct 
ctgggctcta 
ttccctcccc 
catgatggaa 
gccctccaac 
tttaaaacca 
atttacaaat 
aacaagcatc 
ggtctgcamt 
caaggggatc 
acccttytta 
gaaaatgggg 
ggttcagtct 
tttggatctg 
cactgagcat 
tcattgagct 
tgctttatgt 
tttggaagga 
tgtggccatc 
ccactgcact 
aaaaaactcg 



420 
480 
540 
60O 
€60 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
•1500 
• 1560 
1620 
1680 
1740 
1800 
1860 
1862 



<210> 82 

<211> 1618 

<212> DNA 

<213> Homo sapiens 



<400> 82 

ccacgcgtcc 

gaggccacag 

ctcctgctct 

gtggccagcg 

Ccctcagrcct 

atcaccatcc 

ggcagggagg 

atgggcagcc 

acggccctgc 

cttgatgggt 

ttct tcagcc 

aaccagaaag 

cagagtgtga 

cagcgcgaga 

ctcatcaagg 

ctgatacggc 

cccagggctc 

tcccagggcc 

catgcccctc 

cgggtgggca 

ctgcaggaga 

agagcaaagg 

tcccctagcc 

ccattggtgc 

acagccgagg 



gcaaggagcc 

gacacagggt 

ttctcctgat 

actgccaacg 

cttcctccgg 

tgaagggcga 

gtccccaagg 

ccggcgcccc 

acagcggcga 

gctttgacat 

tcaatgcgca 

aggctg-cat 

tgctggacct 

acgccatcta 

ccgaggacga 

atcctgcgag 

tgccagggcc 

tccagctgcc 

tcaccggccc 

gctcctcgtc 

agggccacgg 

agggacctca 

ttccaaaccc 

tcatgcagac 

gagccg Lggc 



agaggccatg 
caeca tgggg 
gtgtgagatc 
gtgctgtgac 
ccgcccccac 
caaaggggac 
ggagcctggc 
gtgccagaag 
ggacttccag 
ggcgaccggc 
cagctggaat 
cctgtacgcg 
ggcctacggg 
cagcaacgac 
ctgagggect 
aagacctgcc 
ttgctcaqtc 
ctcagacact 
agtgccccga 
tcagagccct 
aagccccagg 
ggccttccgt 
aggtggcctg 
tctggggctg 
tccatggcca 



cagtggctca 
acagccgccc 
cc tatggtgg 
tctgaggacc 
gccctgcccg 
ccaggcccaa 
cctcagggca 
cgcttcttcg 
acgctgctct 
cagtttgctg 
tacaaggaga 
cagcccagcg 
gaccgcgtct 
ttcgacacct 
ccgggccacc 
ctcctcactg 
ccttccacca 
gatgtctgtc 
ctctccaggc 
cctccggccc 
etttagagee 
ttcttcttcc 
cccttctccc 
aggtgccccg 
gaegaeggaa 



gggtccgtga 
tgggtcccgt 
agcccacctt 
ccctggatcc 
agatcagacc 
tgggcctgcc 
gcaagggtga 
ccttctcagt 
tcgaaagggt 
ctcccctgcg 
cgtacgtgca 
agegcagcat 
gggtgcggct 
aca tcacctt 
ctcccggctg 
ggatcccctt 
aagtcatctg 
cccaggtgct 
tttatcaagg 
ggtgctgcct 
ctcagcaggt 
agggr.ggggt 
cagagggagg 
gggggegarc 
acagggtctg 



gtcgcctggg 
ctgggcagcg 
tgacagagct 
tgcccatgta 
ctacattaac 
agggtacatg 
caagggggag 
gggccgcaag 
ctttgtgaac 
tggcatctac 
cattatgeat 
catgeagage 
cttcaagcgc 
cagcggccac 
gagageccag 
cctgcctcct 
aacttccgct 
ctctgcccct 
tgc taaggee 
ttacaaacac 
ctggggagct 
ggcctggtgt 
cggcctccgc 
tctggcgctc 
accaagtgcc 



60 
120 
180 
240 
30C 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
102 0 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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aggaagacct gtgctataaa ccaccctgcc tgatcctgcc cctgcctgac cccgccacgc 1560 
cctgczgtcc agcatgatta aagaatgctg tctcctcttg gaaaaaaaaa aaaaaaaa 1618 



<210> 83 

<211> 2034 

<212> DMA 

<213> Homo sapiens 

<220> 
<221> SITE 
<222> (14) 

<223> n equals a.t,g, 
<220> 

<221> SITE 
<222> (382) 
<223> n equals a,t.g, 

<220> 

<221> SITE 

<222> (1999) 

<223> ri equals a,t,g, 

<220> 

<221> SITE 
<222> (2027) 
<223> n equals 



a,t,g. or c 



<400> 83 

actcactaag 

atcccccggg 

ccggccagcg 

acgtactatc 

gtattctggt 

ggttacttga 

cttctggcag 

cagttcatca 

cttgccaact 

agcagccgga 

gagcccctgc 

cagatcacca 

gtgtacctgc 

gaagagrgta 

atgtttgtgg 

gaggggaatg 

tcagtcctgg 

aatgaggcta 

ccccaggaca 

atgtcctacc 

ttggccttga 

catctcaatc 

cggagaacca 

ggcaatgacc 

ggtcntcgag 

tatagccgat 

gaacacgggc 

cagctggctt 

ccactcccag 



ggancaaagc 

ctgcaggaat 

tccctgagac 

agccccgtgc 

ccatctctta 

gtttgtccaa 

gtgtggcctg 

ccatcttgga 

acaactttga 

aggagtctcg 

accgggggac 

gctacctggt 

tgcagaaggc 

atgggcgccg 

accggcgggg 

ctgggtttta 

gctggaatca 

atgccatgga 

tcatcatcta 

cagatgttag 

aggtcatgcc 

taaacaacgc 

aggatgagat 

tcctgctgaa 

tggtgaggca 

Sffgaggtgga 

ccgacttccc 

tgtttcrggc 

cccagaactt 



tggagctcca 
tcggcacgag 
gccaacggca 
cctggagaaa 
ttactcctct 
agtggtgccg 
cntccgaggc 
agcaacacat 
cttccggagc 
agggggccct 
agcagacacc 
ggcgcacacc 
cctcatgcct 
ggcaaayctg 
gacagccgag 
tgaggtgggc 
tccaggcttt 
tgtggtggtc 
Cgcctggtcc 
tgccatgatc 
agacagctgg 
ggagcagctg 
catcaccacc 
gctccrgcag 
gtggttggag 
affaggactgg 
ctggagcgtg 
tcggaagcat 
ccagatgccc 



ccgcggtggc 

atctaccggg 

gtcactgccc 

catgctgaca 

cccttcgcct 

ttttctcact 

attggccgct 

cggaaccagt 

tggccagtcg 

tcccgccggg 

ctcctcaacc 

ctagggcgcc 

gtgctgctgc 

ctggcccgtg 

ccccagggac 

tgcgtctcca 

gctggaagca 

cagtttgcca 

atcggcggct 

ctggatgcct 

aggggcctgg 

tgcagatacc 

acggttcctg 

catcggtatc 

gcctcctcac 

tgtctgtctg 

ggggaggaca 

ctgcacaact 

Cggcacctct 



ggccgctcta 

agakggaytc 

cccattccag 

gcatcctggc 

tcttctactt 

atgccgggac 

ggaccaaccc 

cttcagaaaa 

acttccactg 

gtgtggccct 

gggttaagaa 

ggatgctgta 

agggccaggc 

atggcaatga 

agaagctggt 

cgcccctgga 

cgggggtgcc 

tccaccgcct 

tcactgccac 

ccttrgatga 

tgaccaggac 

agggtcctgt 

aggacatcat 

cccgggtgat 

agctggagga 

tcctccgctc 

tgagtgcaga 

ttgaggccac 

agggaccaac 



gaactagtgg 

tgaaagggcc 

ctcctgggat 

actggcntca 

gtacaggaaa 

attgctgcta 

ccagtaccgg 

caagaggcag 

ggaagaaccc 

gct-cgccca 

gctgccztgt 

tccaggctct 

ccgactggtg 

gattgacacc 

gatctgctgt 

agctggacat 

attcccgcag 

rggcttccag 

gtgggcagcc 

cctggtgccc 

cgtgaggcag 

actgctgatc 

gtccaaccga 

ggcagaggag 

agcctcaatt 

ctaccacgca 

Cggacggcgg 

tcactgcacc 

tgggactcat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
114C 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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tatggaagaa cggggtgaga ggagacatga ggaaagaccc tcttatttgt gattctctgt 
gttcatgttg ctgtttatag tttgrggaaa gtgggggacc atccccct-c tcaccactgt 
tcctcttgca cgtttcccct cattcatgtg gctgtactta accctctcca acatacatcc 
tgcattacat gaatggatta ttcctaataa ttaataaaaa ggtatttttt ctacaaaaaa 
aaaaaaaaaa aaactccana at tact tec t aaaaccggcc cgcgggnccc atcc 



<210> 84 
<211> 2240 
<212> DNA 

<213> Homo sapiens 
<400> 84 



<210> 85 

<211> 1488 

<212> DNA 

<213> Homo sapiens 



60 
120 



-cgaccacgc gtccgcttcc tcatgctgcc cgctcgccta ccgttcaggc tgetgagect 
tttccttcgt ggacccgctc ccacggcagc gcgccatggc ctccgggagc cgctcctgga 

gaggaggtgc gctgccgcct: cctccttcca gcactcatcg agtctgggac gcgagcttcc 180 

ttatgacccc gtggacacgg agggctttgg agaaggtggt gaeatgeagg agegttttet 240 

gttcceggag tacatcctgg^atccggagcc gcaacccacc cgegaaaage agetgeagga 300 

gctccagcaa cagcaggagg aggaggagcg acagaggcag cageggeggg aggageggeg 360 

acagcaaaac ctacgggcca ggccccggga gcacccggtc gtggggcacc cggacccggc 420 

attgccgccc agcggcgtga actgeteggg ctgeggggea gagctgeact gccaggacgc 480 

cggagtgccc ggctacctgc cccgagagaa gttcctccgc aeggeggagg cagaeggegg 540 

getggcaegg accgtgtgcc agegctgetg getgetgteg caccaccggc gcgctctacg 600 

ectgeaggtg agccgcgagc agtacctgga gctggtgagc gccgcgttgc ggcggcccgg 660 

cccctccctg gtgetctaca tggtggacct gctggacctg cccgacgccc tgctgcccga 720 

cttgcccgcg ctggtgggcc ccaagcagct gatcgngctg ggaaacaaag tggacctcct 780 

gccccaggat gctcctggct aceggcagag getgegggag cgactgtggg aggactgtgc 840 

ccgcgccggg ctcctgctgg cccctggcca ccaagggcca cagcgccccg tcaaggacga 900 

gccacaggac ggggagaatc cgaatccgcc gaactggtcc cgcacagtgg teagggaegt 960 

geggctgate agegecaaga ccggctatgg agtggaagag ttgacctctg cccttcagcg 1020 

ctcctggcgc taccgtgggg aegtctaett agtgggcgcc accaacgccg gcaaatccac 1080 

tctctttaac acgctcctgg agtccgatta ctgcactgcc aagggctccg acgccatcga 1140 

cagagccacc atctcccctt ggecaggtae tacattaaac cttctgaagt ttcctatttg 1200 

caacccaact ccttacagaa tgtttaaaag gcatcaaaga c:taaaaaag attcaactca 1260 

agctgaagaa gatcttagtg agcaagaaca aaatcagctt aatgtcctca aaaagcatgg 1320 

ttatgtcgta ggaagagttg gaaggacatt cttgtattca gaagaacaga aggataacat 1380 

cccctttgag tttgatgctg attcacttgc ctttgacatg gaaaatgacc ctgttatgog 1440 

tacacacaaa tccaccaaac aagtagaatt gaetgeacaa gatgtgaaag atgcccactg 1500 

gttttatgac acccctggaa tuacaaaaga aaattgtant ttaaatcttc taacagaaaa 1560 

agaagtaaat attgttttgc caacacagtc cattgttcca agaacctttg tgettaaace 1620 

aggaatggtt ctgtttttgg gtgeratagg cegcatagat ctcctgcagg gaaatcagee 1680 

agcttggttt acagtcgtgg cttccaacat cctccctgtg catatcacct ccttggacag 1740 

ggcagacget ctgtatcaga ageatgeagg teataegtta ctccagattc caatgggtgg 1800 

aaaagaacga atggcaggat ttcctcctct tgttgctgaa gacattatgt taaaagaagg i860 

actgggggca tctgaagcag tggecgacat caagttttcc tetgeaggtt gggtttcagt 1920 

aacacctaat tttaaggaca gaetgeatet ccgaggctat acacctgaag gaacagtttt 1980 
gaccgtccgg ccccctctct tgecatacat tgttaacatc aaaggacagc gcatcaagaa 
aagtgtggrc tataaaacca agaagcctcc ttcccttatg tacaacgtga ggaagaagaa 
aggaaagata aatgtatgag accgaccttg ttcaccccag acattaactg taccgaacac 
aacaaaatac atcgaatttg tattaaacat ataaegcata aacaaagctc ccattcttac 
ccttaaaaaa aaaaaaaaaa 
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cgccaagttt coggagggag agggtagaaa ctggaggggg tggacctgtc ac tcacggga 60 

ctgagggtcc ttttctcccg ctecccaggag gaacgagaat gaatatgacz caageccggg 120 

ttctggtggc tgcagtggtg gggttggtgg ctgtcctgct ctacgcctcc acccacaaga 180 

ttgaggaggg ccat=tggct gtgtactaca ggggaggagc tttaetaact agccccagtg 240 

gaccaggcta tcatatcatg ttgcctttca ttactacgtt cagotctgtg cagacaacac 300 

tacaaaccga tgaagttaaa aatgtgcctt gtggaacaag tggtggggt:c atgatctata 360 

ttgaccgaac agaagtggtt aatatgttgg ctccctatgc agtgtttgat atcgtgagga 420 

actatactgc agattatgac aagaccttaa tcttcaataa aatccaccat gagctgaacc 480 

agttctgcag tgcccacaca cttcaggaag tttacattga attgtttgat caaatagatg 540 

aaaacctgaa gcaagctctg cagaaagact taaacctcat ggccccaggt ctcactatac 600 

aggctgngcg tgecacaaaa cccaaaatcc cagaagccat aagaagaaat tttgagttaa 660 

tggaggctga gaagacaaaa ctccttatag ctgcacagaa acaaaaggtt gtggaaaaag 720 

aagctgagac agagaggaaa aaggcagtta tagaagcaga gaagattgca caagtggcaa 780 

aaattcggtt tcagcagaaa gtgatggaaa aagaaactga aaagcgcatt tctgaaatcg 840 

aagatgctgc attcctggcc cgagayaaag cgaaagcaga tgctgaatat tatgctgcac 900 

acaaatatge cacctcaaac aagcacaagt tgaccccgga atatctggag ctcaaaaagt 960 

accaggccat tgcttctaac ^agtaagatct attttggcag caacatccct aacatgttcg 1020 

tggactcctc atgtgctttg^ aaatattcag atattaggac tggaagagaa agctcactcc 1080 

cctctaagga ggctcttgaa ccctctggag agaacgtcat ccaaaacaaa gagagcacag 1140 
gttgatgcaa gaggtggaaa tgttctccat atcaagatgt ggcccaaggg gttaagtggg * 1200 

aacaatcatt atacggactc ttcagattta cagagaactt acaccccatc tgtcccacct 126C 

ctcctgcgat agtcccgggt gctccacnga ttggaggata gagccagctg tcf.gacacac 1320 

aaatggtctt ttcagccaca gtcttatcaa gtatcctata tgtattcctt tctaaactgc 1380 

zactcatgaa tgaggaaagt ctgatgctaa gatactgcct gcactggaat gttaaacact 1440 

aaatatataa caagctgtgt tctcctaagc tgaaaaaaaa aaaaaaaa 1488 



<210> 86 
<211> 3174 
<212> DNA 

<213> Homo sapiens 



<400> 86 

gcggacgctg grcscaaaca ctaaggcctg agcggtgaca atcgaggcga gatgatggtc 60 

aacagggaat. gcctcgtggg agaaaaaaga caatzttact ctcagcgctg attttgagat 120 

gatgggctcg ggaaacgggc gtcgcagcat gaagtcgccg cccctcgtcc cggccgccct 180 

ggtggcctgc atcatcgtct tgggcttcaa ctiactggatt gcgagctccc ggagcgtgga 240 

cctccagaca cggatcatgg agctggaagg cagggtccgc aggcgggccg cagagagagg 300 

cgccgtggag ctgaagaaga acgagttcca gggagagctg gagaagcagc gggagcagct 360 

tgacaaaatc cagtccagcc acaacttcca gctggagagc gccaacaagc tgzaccagga 420 

cgaaaaggcg gttttggtga ataacatcac cacaggtgag aggctcatcc gagtgctgca 480 

agaccagtta aagaccctgc agaggaatta cggcaggctg cagcaggatg tcctccagtt 540 

tcagaagaac cagaccaacc tggagaggaa gttctcctac gacctgagcc agtgcatcaa 600 

tcagatgaag gaggtgaagg aacagtgtga ggagcgaata gaagaggtca ccaaaaaggg 660 

gaatgaagct gtagcttcca gagacctgag tgaaaacaac gaccagagac agcagctcca 720 

agccctcagt gagcctcagc ccaggctgca ggcagcaggc ctgccacaca cagaggtgcc 780 

acaagggaag ggaaacgtgc ttggtaacag caagtcccag acaccagccc ccagttccga 840 

agtggtt.ttg gattcaaaga gacaagttga gaaagaggaa accaatgaga tccaggtggc 900 

gaatgaggag cctcagaggg acaggctgcc gcaggagcca ggccgggagc aggtggtgga 960 

agacagacct gtaggcggaa gaggcctcgg gggagccgga gaactgggcc agaccccaca 1020 

ggtgcaggct gccctgtyag tgagccagga aaatccagag atggagggcc ctgagcgaga 1080 

ccagcttgtc atccccgacg gacaggagga ggagcaggaa gctgccgggg aagggagaaa 1L40 

ccagcagaaa ctgagaggag aagatgacta caacatggat gaaaatgaag cagaatctga 1200 

gaeagacaag caagcagccc tggcagggaa tgacagaaac atagatgttt ttaatgttga 1260 

agatcagaaa agagacacca taaatttact tgaccagcgt gaaaagcgga atcatacact 1320 

ctgaattgaa ctggaatcac atatttcaca acagggccga agagatgact ataaaatgt-.t-. 1380 

catgagggac tgaataccga aaactgtgaa atgtactaaa taaaatgtac atccgaagat 1440 

gattattgtg aaattttagt a tgcactttg tgtaggaaaa aatggaatgg rcttttaaac 1500 

agcttttggg gggtactttg gaagtgtcta aLciaggtgCc acaacttttg gtagtaggta 1560 
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tttcgtgaga 
cggggcttcc 
ccatctgctg 
accaagtgct 
tgtagtcact 
acaaacgtca 
agaatttttt 
attaagtggc 
aaagcaacat 
gacaaaacaa 
atgagagggc 
aacaaagttc 
aacaggaata 
cacaaaactg 
ttcacaa taa 
aataggctct 
ggctctctaa 
cgtttgatcc 
ctagctggtg 
cacacagtca 
tcttcacatg 
tagataaa&t 
tcctccttgg 
atgtttaatt 
cttgtcacag 
ccaccctcaa 
gggctgtcag 



agrttcaacac 
tgattctgga 
tgaaggagag 
ggtgccagcc 
tctgattctg 
ctagcaaagt 
aaaaggctac 
ttagaatttg 
cttccctaaa 
acaggccaaa 
aagtttgttg 
attgatgtcg 
caatggttgt 
attttaaaat 
caatggcatt 
taccacttgc 

ggggtaagta^ 

aaccctctta 5 
cgctggcacc 
ctgaagcagg 
rgatgtcctt 
cttagagtac 
ggttgatagg 
tgfcacgcata 
ctctgttttt 
actgtggac^ 
gacgtcttca 



caaaactgga 
atataacttt 
acagagaac t 
tgttacctct 
acaatcaatc 
agcaacagct 
ttgtataata 
aaeatttgtg 
tatatattac 
agggggctga 
tggacagatc 
aaggatatat 
ttttatcata 
caagttaatg 
aetggtctgac 
aaataactgg 
,0gaggatcca 
ttttcagagg 
cctggcctca 
ccctgtetgc 
agtcactaag 
tttcatactc 
aattttcrct 
gaattgctga 
ttgcttttcc 
aaggatggca 
acccacagta 



acatagttct 
gtgtaaatta 
ctggg- tccg 
tctcactgaa 
aatcaatggc 
ctaagtctaa 
acccttgcca 
gtctttattt 
caagaaaagc 
cctggagcag 
tgtgcctact 
acagtgttag 
gtgtacacat 
tgaattttga 
ctgagttggt 
ccacatcatt 
caggatttga 
ggaaaatggg 
cacagactcc 
aattcacgtt 
gttaaacttt 
ttctaagtcc 
tgctttctca 
gaaataaaat 
ccttcaccct 
gtttgcctta 
ccccgaaagc 



cctccaagtg 

acagccacct 

tcgccctgtc 

aagtctggct 

ctagagcact 

atacaaagct 

tttttaatgt 

actttgcttc 

aagaagcaga 

agcacggtga 

ttattactgg 

aaattaggac 

ttagcttgtg 

aaattaccac 

tcctagtatt 

aatgactgac 

gatgctaagg 

gcctagaagt 

cgagtagctg 

gccacctcca 

cccacccaga 

tcttccagcc 

ataaagtctc 

gttctgttca 

ctattagatc 

aaccttctgt 

tttgr.gct-.ca 



ttggcgacag 
atagaagagt 
cacgtgctgt 
aatgctcttg 
gactgttaac 
gttctgtgtg 
acaaaacgct 
gtgtgtgggc 
ttaggttttt 
gaggcaaggc 
agtaaaagaa 
tgtttagaaa 
gtaaatgact 
ttaatcctaa 
atttatggta 
ttcccagtaa 
ccccagagat 
tacagagcat 
ggactacagg 
ac^taaacat 
aaaggcaact 
tcactttgag 
tattcatctc 
acttaaaaaa 
cttaaagaat 
ggaatgagat 
ccac 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
27O0 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3174 



<210> 87 
<211> 2780 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (2760) 

<223> n equals a,t,g, or c 



<40O> 87 

ggcgccgcgg 

gtggacactg 

tggaagakgc 

ccgcccaggg 

tgaagtacca 

acgaggagcc 

aagggctgta 

ttatcaagtt 

agctggctct 

agcactatga 

gagatgggaa 

acgtggagtc 

aggccccggt 

aaaataactt 

ggtgtggtca 

ctggtctggc 

gcaagtattc 

gtgagcacag 

aagacgaact 



gcggggggtc 
ccagcggctg 
caaagtctat 
ggtgcgagga 
gggtcctcgg 
agtgacacca 
tgagctctca 
ct-.tcgctccg 
gggccttgaa 
actctgcccc 
aaaggtggat 
gcagctgcag 
gctggcagct 
cgatgacacc 
ctgtaagact 
gggggtcaag 

ggtacgaggc 
tggaggcaga 
ttaggaacac 



ccggggccac 
cagccgactt 
gtggctaaag 
taccccacct 
gacttccaga 
gagccggaag 
gcaagcaact 
tggtgtggtc 
cattccgaaa 
ggaaaccagg 
cagtacaagg 
cgcacagaga 
gagcccgagg 
attgcagaag 
ctggctccta 
atcgccgaag 
taccccacgt 
gaccttgact 
agttggaggt 



gtggacgccg 
ggaatgacct 
tggactgcac 
taaagctttL 
cactggaaaa 
tggaaccgcc 
ttgagctgca 
actgcaaagc 
ctgtcaagat 
tccgtggcta 
gaaagcggga 
ctggagcgac 
ctgacaaggg 
gaataacctt 
cttgggagga 
tagactgcac 
tattgctttt 
cgttacaccg 
cacctctccc 



acggctctgg 
gggagacaaa 
ggcccactcc 
caagccaggc 
ctggatgctg 
cagtgccccc 
cgttgcacaa 
cctggctcca 
tggcaaggtt 
tcccactctt 
tttggagtca 
ggagaccgtc 
cactgtgttg 
catcaagttt 
actctctaaa 
tgctgaacrgg 
ccgaggaggg 
ctttgccctg 
gcccagctcc 



tctgcggcgt 
tacaacagca 
gacgtgtgct 
caagaagctg 
cagacactga 
gagctcaagc 
ggcgaccact 
acctgggagc 
gatcgtacac 
ccczggrtcc 
ctgagggagt 
acgccctcag 
gcactcactg 
tatgctccat 
aaggaatccc 
aaratctgca 
aagaaagtca 
agccaagcga 
cgcaccctgc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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gtttaggagr tcagtcccac agaggccact gggttcccag tggtggctgt tcagaaagca 1200 
gaacatacta agcgtgaggt atcttctttg tgngtgtgtt ttccaagcca acacactcta 1260 

cagattcttt attaagttaa gtttctctaa gtaaatgtgt aactcatggt cactgtgtaa 1320 

acattttcag tggcgatata tcccctttga ccttctcttg atgaaattta catgctttcc 1380 

tttgagacta aaatagcgct gagggaaatg aaattgctgg actatttgtg gctcctgagt 1440 

tgagtgattt tggtgaaaga aagcacatcc aaagcatagt ttacctgccc acgagttctg 1500 

gaaaggtggc cttgtggcag tattgacgtt cctcrgatct taaggtcaca gttgactcaa 1560 

-actgtgttg gtccgtagca tggagcagat tgaaatgcaa aaacccacac ctctggaaga 1620 

taccttcacg gccgctgctg gagcttctgt tgctgtgaat acttctctca gtg-gagagg 1680 

ttagccgtga tgaaagcagc gttacttctg accgtgcctg agtaagagaa tgctgatgcc 1740 

ataactttat gtgtcgatac ttgtcaaatc agttactgtt caggggatcc ttctgtttct 1800 

cacggggtga aacatgtctt tagctcctca tgttaacacg aagccagagc ccacatgaac i860 

tgttggatgt cttccttaga aagggtaggc atggaaaatt ccacgaggct cattctcagt 1920 

atctcattaa ctcattgaaa gattccagtt gtatttgtca cctggggtga caagaccaga 1980 

caggctttcc caggcctggg tatccaggga ggctctgcag ccctgctgaa gggccctaac 2040 

tagagctcta gagtttctga ttctgtttct cagtagtcct tttagaggct tgctatactt 2100 

ggtctgcttc aaggaggtcg acc*tctaat gtatgaagaa tgggatgcat ttgatctcaa 2160 

gaccaaagac agatgtcagt gggctgctct ggccctggtg tgcacggctg tggcagctgt 2220 

tgatgccagt gtcctctaoc tcatgctgtc cztgcgatta aacacctcta Lctcccttgg 22*80 

gaataagcac atacaggctt aagctccaag ataggtgttt gtccttttac catcgagcta 2340 

cttcccataa taaccacttt gcatccaaca ctcttcaccc acctcccata cgcaagggga 2400 

tgtggatact tggcccaaag taactggtgg taggaatctt agaaacaaga ccacttatac 2460 

tgtctgtctg aggcagaaga taacagcagc atctcgacca gcctctgcct taaaggaaat 2520 

cttta-taat cacgtatggt tcacagataa ttctttcttt aaaaaaaccc aacctcctag 2580 

agaagcacaa ctgtcaagag tcttgtacac acaacttcag ctttgcatca cgagtcttgt 2640 

attccaagaa aatcaaagtg gtacaatttg tttgtttaca ccatgatact ttctaaataa 2700 

actctttttt tttaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaggggg gggggggggn 2760 

ggggcccccc cccccccaaa 2780 



<210> 88 
<211> 1061 
<212> DNA 

<213> Homo sapiens- 
<400> 88 

aattcggcac gagagaagga aatacatcaa aa tgcccaca ttggttatct gtagaggata 
gaatgaaaga tggctttatt ctcttgtttg ctctcattaa agcaatcaga tggtgcttct 
cctgtactca gagccctggc tgcttcctgc ctggcctctc c.gcgggcty ctgtgcaacc 
agaaaagcct taaacggaaa tgtgggagag aaggctggat tcactttcat gtctttccag 
ggttgtgacc cctcaagtcc tggttgcctt tgctgttctc tattaccttc aaacagccag Jyu 
ctcgtcttta tttctttttt agtttngtcg gggctggctt gatagatgtt agtccatcat 360 
agccagatgt gtctagcctt gtcttttgaa tgcaagattt aggatgtggg tacttagctg 420 
ttagcggaca tcagogtcac tagtcaggat gaaagagttc ttggctttaa cccccagaaa 
ttctggtaac gtcatgtata gtgacggccg catgtctaac aggtggccag gtaagtcttt 
tggggtggtc tgtgaatcac agtttgggag acattgactt ttagggagtc tgttctgaat 
tcactagata atagagatat aatacagagc tttgaaagct ggtgtcttga tgacagagcc 
gtggcaatgg ggagggttga ggaggtggct gttgggcctg tctcccggtg agagttgaaa 720 
gggcctgaac tcaagcagag gcctcagaac cgaaaggtgg tggaaggatg cagcaagagg 780 
cgccacacag gagnactctg cgccccggca gggtctgaat acacgtggga gtggtgagag 
ggagaacttt aagtccaggc tttgtgcctc agtgacttag tgtggccata tcattagaaa 
tgtgttgagg ccgggcacag tggctcatgt ctgtaatccc agcactttga gaggctgagg 
caggaggatg gcttgaggcc aggagtttaa aaccagcctg gacaacazag tgagagcctg 
tctctacaaa aaaaaaaaaa aaaaactcga gggggggccc g 



60 
120 
180 
240 
300 



480 
540 
600 
660 



840 
900 
960 
1020 
1051 



<210> 89 
<211> 1342 
<212> UNA 
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•*213> Homo sapiens 
<400> 89 

ccacgcgtcc gggcgcgcgg gtgaaaggcg catfcgatgca gcctgcggcg gcctcggagc 
gcggcggagc agacgctgac cacgttcctc tcctcggtct cctccgcctc cagctccgcg 
ctgcccggca gccgggagcc atgcgacccc agggccccgc cgcctccccg cagcggctcc 
gcggcctcct gctgctcctg ctgctgcagc tgcccgcgcc gtcgagcgcc tctgagatcc 

ccaaggggaa gcaaaaggcg cagctccggc agagggaggt ggtggacctg tataatggaa juu 

tgtgcttaca agggccagca ggagtgcctg gtcgagacgg gagccctggg gccaatggca 360 

ttccgggtac acctgggatc ccaggtcggg atggattcaa aggagaaaag ggggaatgtc 420 

tgagggaaag ctttgaggac tcctggacac ccaactacaa gcagtgttca tggagttcat 480 

tgaattatgg catagatctt gggaaaattg cggagtgtac acttacaaag atgcgttcaa 540 

atagtgctct aagagztt-g ttcagtggct cacttcggct aaaatgcaga aatgcatgct 600 

gtgagcgttg gtattccaca ttcaatggag ctgaatgttc aggacctctt cccattgaog 660 

ctataattta tttggaccaa ggaagccctg aaatgaaLLc aacaattaat attcatcgca 720 

cttcttctgt ggaaggactt tgtgaaggaa ttggtgctgg attagtggat gttgctatct 780 

gggfctggcac ttgttcagat tacccaaaag gagacgcttc tactggatgg aattcagttt 840 

ctcgcatcat tattgaagaa ctaccaaaat aaatgcttta attttcattt gctacctctt 900 

tttttattat gccttggaat ggttcactta aatgacattt zaaataagtt tatgtataca *960 

tctgaatgaa aagcaaagcr aaatatgttt: acagaccaaa gtgtgatttc acactgtttt 1020 

caaatctagc attattcatt ttgcttcaat caaaagtggt ttcaatattt nttttagttg 1080 

gttagaatac tttcttcata gt:c:acattct ctcaacctat aatttggaat: attgttgtgg 1140 

tcttttgttt tttctcttag tatagcattt ttaaaaaaat ataaaagcta ccaatctttg 1200 

tacaatttgt aaatgttaag aattrttttt atatctctta aataaaaatt atttccaaaa 1260 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1320 

aaaaaaaaaa aaaaaaaaaa aa 1342 



<210> 90 

<211> 770 

<212> DNA 

<213> Homo sapiens 

<22C> 

<221> SITE 
<222> (690) 

<22 3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (762) 

<223> n equals a,t,g, or c 



<400> 90 

ggtagggtcg ggkcgaccyr cgcgtccggc tcttcctcag gcggcggcca tggcgggaca 
ggaggatccg gtgcagcggg agattcacca ggactgggct aaccgggagt acattgagat 120 
aatcaccagc agcatcaaga aaatcgcaga ctttctcaac tcgttcgata tgtcttgtcg iao 
ttcaagactt gcaacactaa acgagaaatt gacagccctt gaacggagaa tagagtacat 
tgaagctcgg gtgacaaaag gtgagacact cacctagaac agtgccgtgc tgctgctggg 
aagttgcttt acacaacaca ggccacatgg gaaaggcccc agcagccttc agctc=ttcc 
ttccccctta aagagcaaca gggcttattc ttgtttttct tttttcaaaa gtgtggcctt 
tgggctctgc catctggggt gtggtgtggt atgtgggaag aagttcagag aaaccgctgg 
aaacgacgtt aggcatttta ccttttcagt aacattttat acatctactt grcaatgtat 
ttgagacatt cacagccaaa agcctgggac tctttgtgaa ggccctcctc amctctatct 
ttctttctcc ctctctcaaa ctttccttaa agttctcatt gcctttgcac tgcttctgtg 
aacagtcttt gtctcctccc caccttrggn ggaaagtgcg gggcaqtcct ggtcaagaca 720 
ctcatgccct ggcaatgtgg cctgcagaga atgttgttgt anccaccagt 770 



60 
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300 
360 
420 
480 
540 
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<210> 91 

<211> 1570 

<212> DNA 

<213>- Homo sapiens 



<400> 91 

gatggtttta 

atctacaatc 

gatgact-tc 

aaaacctctt 

atcatgracat 

aacttctccc 

catctacacg 

ttgagcttgg 

tctacactgt 

tctctatcaa 

ggaagcattc 

gccaccttaa 

agttatgttt 

gccagtctgg 

gctgtgctca 

cttttcccgc 

tgtggatatg 

tgattcttrat 

aaatgtaagc 

atatgtgttc 

ctgtaaatta 

atgatcattc 

attcaatgga 

caggagttaa 

cctatgtatt 

atatgaaagt 

aaaaaaaaaa 



ctgaagctgc 
tgtgagtatc 
agaagatgaa 
tggagcttct 
cacaocctg: 
aagcagagaa 
cagaaatcaa 
ggatcatttt 
tgaactctgc 
tcgccacaga 
tgagtgctct 
atcctgcctc 
cttactttt-.a 
ctggawctct 
cLgctgugct 
ctcacagtta 
aagaactat- 
gataatatgg 
ttttaaagta 
attggggatc 

ccatttacta 
ccagaaacat 
tcacagtgaa 
ctgatttgtt 
ttaactcatt 
agctggctcc 



taaaagttta 
acatcctgta 
ggtaagtaga 
gaggactcag 
tcccaatgag 
acccgaaccc 
agttattggg 
ggcatctgct 
ttacccattc 
gaaaaggttr 
gtctgccctg 
actgcagtgt 
tcatgattca 
ctctctgatg 
gcggtggaaa 
cattggtaat 
gacttcttaa 
aaaagttaac 
atgaacatta 
tcttgatttg 
gattagccaa 
ttcccagaaa 
gcaaaggaca 

gtggtgctca 

agtgcataaa 
aaaataaaat 



cagaaaagga 
tagctgtmaa 
aaccgttgat 
ctggaaccaa 
accatcacag 
accaaccagg 
actatccaga 
tccttctctc 
ataggaccct 
accaagcttt 
gtgggtttca 
gagttggaca 
ctttatacca 
ctgatttgca 
caggcctact 
tctggcatgt 
gaaaaaaggg 
cattatagaa 
aaaaaaacca 
cctgacattg 
atagtctgaa 
atgtttccca 
caacttrtta 
gactttttta 
ggcaagcccc 
tttcgagtgc 



agtgcaggaa 
cactggaata 
gggactgaga 
cgggcacagt 
tgctcccatc 
ggcaggatag 
tcttgtgtgg 
caaactttac 
ttttttttat 
tggcgcatag 
ttatcctgtc 
aaaataatat 
cggactgcta 
ctctgctgga 
ctgacttccc 
cctcaaaaat 
agaaatacra 
aagcaaagct 
ttatttcact 
acttcagcaa 
tttccagaaa 
gaaaactaga 
ttgtacccct 
tacaggtgct 
atataatgaa 
aaaaaaaaaa 



catttcacaa 
aggaagggct 
aaccagagtt 
tggcaacacc 
aaacgtcatc 
cctgaagaaa 
catgatggta 
ccaagtgact 
catctcLygc 
cagcctggtt 
tgtcaaacag 
accaacaaga 
tacagccaaa 
at-ctgccta 
tgggagtgta 
gacccatgac 
accagaaagt 
tgagtttcct 
gtcactcaag 
aagcacgggg 
acaaggcaga 
cagmatgatc 
taattgtcaa 
agtgttttat 
gtctcagggt 
aaaaataaaa 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 

- 780 

- 840 
900 
960 

1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1570 



<210> 92 

<211> 2950 

<212> DNA 

<213> Homo sapiens 



<40O> 92 

cccggctccc 

cagtctgccc 

ccgggcgcgc 

tgcfcgctcgg 

gactctatcc 

gtgtgcgggc 

tgcttttctg 

tcgaggagcc 

cccagcagcc 

ggaattccac 

gcccgccccc 

ccaagtagtg 

tctgtcttcg 

ctgatatcct 

actcacactt 

gtgtggtgac 

agagactgag 

gagcatccca 



gcccgctccc 
gcccgccccg 
tggagaggac 
gctgctcttg 
aaccnactat 
cctccccata 
ctgcggagcc 
agccttcaat 

ggggccgccc 
ggcaatggct 
tccagcctac 
gggtgcccac 
ttgatgttca 
cacagcaagc 
tgtcccctct 
agtccccgag 
aggaagaagg 
ggcaagcatic: 



agccgggccc 
cgcgggggcc 
gcgaggagcc 
gagtgcacag 
atatgccgct 
cagaggctgt 
ggcttcttca 
gtgtcctaca 
tattacacyg 
ttccaggtcc 
tgcaacacgc 
gtgcaagagg 
cttccaggaa 
acagctctcx 
gttgcttctg 
ggctgacgtc 
cagtgctgga 
cctctgcccg 



cccagcggtc 
gagtcgcgaa 
atgaggcgcc 
aagccaaaaa 
cctacgagga 
ggtacttctg 
tccggaggcg 
ccaggcagcc 
acccaggagg 
cacccaactc 
ctccgccccc 
agagacagga 
cggtctcgtg 
ttcaggcttt 
tttctgacgc 
cttacggtgg 
ggtgcaggtg 
ggtattaata 



ggcgggacgg 
gcgcgcctgc 
agctgcgaag 
gcattgctgg 
ctgctgtggc 
gctccttctg 
ratgtacccc 
cccaaatccc 
accggggang 
accccagggg 
gtacgaacag 
gagggccttt 
ggctgctaag 
ccatggagta 
atctgtgctc 
cgtgaccaga 
gcatgtagag 
ggaagcccca 



ctcccggctg 
gacccggcgt 
gcggcggcgc 
tatttcgaag 
tccaggtgct 
atgatgggcg 
ccgccgctga 
ggcccaggag 
aaccctgtcg 
agtgtggcct 
gtagtgaagg 
ccctggcctt 
ggcagttcct 
caatatatga 
tcacatggta 
tctacaggag 
gggccaggcc 
-gccgggcgg 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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ctcagccgat gaagcagcag ccgactgagc 
tcctcccgtc agccttcccc ttccagaagc 
cccttgtcat gtttctgtct ctgttcatat 
gaaagggtag cacgtgcagc tctcaccgca 
gcctgacagt gatctgacat ccactaagca 
ctgccccaaa ctgagacatt gcattttgtg 
ttacccattt ttttggtgtg tttatggaag 
agactgggtg tgtgtcttcc ctggacatca 
ggtctccttc cctcaggcag ctccagtggt 
acatctggct gggaegtcac atggactctt 
ctctgaggtt gtgctgggtc taagcgggtg 
tgggaaaaga caggagaagt actgactcaa 
aaagaagaag tggtcggaaa tgcacattcc 
tcacaggtag tctcctgagt agttgacggc 
tagtgttgat gtgtgaacgc tgacctgLcjc 
ggcgcctaga ttactagatg tgctgtatca 
aatgaactaa tea gage etc jztgagaaatt 
ctcatggaag tggatacgga gtgatttggt 
fcggagaacc tcctggggaa ttttgtggga 
aatgcagttg caagcacaga tgctgccacc 
gactgccagc gtggtacctc ccatgctgca 
aatgttcacc gtttacaacc aagacaactg 
tcatttgttt tgcattttta atgtctttat 
ccc tggaatc gggtgcagct gaataggcac 
tttttgatag caaattatgt taagagacag 
ccatgtctgt gtgagacaga gtttgttttc 
gaacgctgat cctgcatatg gaagtcccrc 
ccattgtgtg gatggcgggt tgtgcccact 
aattcccgga gcgtccgtgg ttcagagtaa 
ctgcaacaat tggctcgttt ctcttttttg 
tgtgcaaaat aaaaataaat ttgggcaaaa 
aaaaaaaaag 



tgagcccagc aggtcatctg ctccagcctg 
tgttggagag acattcagga gagagcaagc 
cctaaagata gacttctcct gcaccgccag 
gatggggcct agaaecaggc ttgcttggag 
aatttattta aattcatggg aaatcacctc 
agctcttggt ctgatttgga gaaaggactg 
tgcatgtaga gcgtcctgcc ctttgaaatc 
ctgcctctcc agggcattct caggcccggg 
gggtcctgaa gggtgctttc aaaacggggc 
ccagggagag agaccagccg aggcgtctct 
tgtgctgggc tccaaggagg aggagcttgc 
ctgcactgac catgttgtca taattagaat 
tggataggaa tcacagctca ccccaggatc 
tagcggggag ctagttccgc cgcatagtta 
tgtgtgctaa gagctatgca gcttagctga 
cggggaatga ggtgggggtg ctzatttttt 
gttactcatt gaactggagc atcaagacat 
gtccatgctt ttcactctga ggacatttaa 
gacacttggg aacaoaacag acaccctggg 
agrgtctctg accaccctgg tgtgactgct 
ggcctccatc taaatgagac aacaaagcac 
cgtgggtcca aacactcctc ttcctccagg 
tttttgtaat gaaaaagcac actaagctgc 
ccaaaagtcc gtgactaaat ttcgtttgtc 
tgatggctag ggctcaacaa ttttgtattc 
ccttgaactt ggttagaatt gtgctactgt 
ttcggtgaca tttcctggcc attcttgttt 
tcctggagtg agacagctcc tggtgtgtag 
acttgaagca gatctgtgca tgcttttcct 
ttctcttttg acaggatcct gtttcctatg 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



<210> 93 
<211> 1722 
<212> DNA 

<213> Homo sapiens 
<40O> 93 

ggcacgagcc agagcaggct gctaggcctg 
agtgtgctgg gtcactggga acttcctgaa 
tggggtgcgg gcagcaccgc aggaagagga 
aggctctgtg attggctgcg gcacgatgac 
gggccgggcc cgggggacag aatccgcccc 
gagctggcag accyaggagg tgcgacagac 
cgggagcgcg ggcgcaaagg caccagggcc 
gggttccgcg ggccttcggg tgcgcgtctc 
gatcctgggc ctggtgctgt gcctggtggg 
gcccatgtgg caggtgaccg ccttcctgga 
gaaggggctg tggatgtcgt gcgtggtgca 
cgactcggtg ctggctctga gcaccgaggt 
cgtgctgctg gcgttcgttg cgctcttcgt 
cgtggccccg ggcccggcca argcgcgtgt 
ttgcgggccg ctggcgcxcg tgccactctg 
ttacgacccg nctgtgcccg tgtcgcagaa 
ctgggcggcc accgcgctgc tcatggtagg 
ctgcaccggc cgtcccgacc tcagcttccc 
ggccaccggc gactacgaca agaagaacta 



gggccaccac tgcccctggg tgctacaccc 
gtgctgtcac ctgaactggg cccccaagga 
gcagcccctg tgaagattga gagctgccag 
ccgcgcacgg attggctgct tcgggccggg 
cgaaccttca aagagggtac cccccggcag 
ccgcggggca aacggactgg ggccaagagc 
cgcccagggc gccgcgcagc acggccctgg 
gcctctagcc atggggtccg cogcgttgga 
ctgggggggt ctgatcctgg cgtgcgggct 
ccacaacatc gtgacggcgc agaccacctg 
gagcacsggg cacatgcagt gcaaagtgta 
gcaggcggcg cgggcgctca ccgtgagcgc 
gaccctggcg ggcgcgcagt gcaccacctg 
ggccctcacg ggaggcgtgc tctacctgtt 
ctggttcgcc aacattgtcg tccgcgagtt 
gtacgagctg ggcgcagcgc tgtacatcgg 
cggctgcctc ttgtgctgcg gcgcctgggt 
cgcgaag-ac tcagcgccgc ggcggcccac 
cg'-cigaggg cgctgggcac ggccgggccc 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
*2220 
'2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2950 
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ctcctgscag ccacgcctgc gaggcgttgg ataagcctgg ggagccccgc atggaccgcg 1200 

gcttccgccg ggtagcgcgg cgcgcaggct cctcggaacg tccggctctg cgccccgacg 1260 

cggctcctgg acccgctcct gcetgcgccc gcagctgacc ttctcctgcc actagcccgg 1320 

ccctgccctr. aacagacgga angaagtttc cntttctgtg cgcggcgctg Lttccatagg 1380 

cagagcgggt gtcagactga ggatttcgct tcccctccaa gacgctgggg gtc~tggctg 1440 

ctgcctracc tcccagaggc tcctgctgac ttcggagggg cggatgcaga gcccagggcc 1500 

cccaccggaa gatgtgtaca gctggtcctt actccatcgg cagggcccga gcccagggac 1560 

cagtgacttg gcctggacct cccggtccca ctccagcatc tccccaggca aggcttgtgg 1620 

gcaccggagc ttgagagagg gcgggagtgg gaaggctaag aatctgctta gtaaatggtt 1680 

ngaactctca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 1722 



<210> 94 

<211> 635 

<212> DNA 

^21 3 > Homo sapiens 

<400> 94 

gcctaaagag agctccccca ggaccagccc tggccaaggg atzgctgcag ccctcatcca 6{J 

cctrccaagc actggaaaca aacattggag accaagtgag gcgtcactca acacccgtag 12G 

taatcaggga aatgacaagt tacatactga tatcctttgt tttgctgatt ggagttgggt 180 

gcattgeaaa agatcagtcg tgcccagtgt ttgggggaag gaagcgtctt cacctgttgt 240 

ttgtggcagg acagttgagg caggtgagga tgctgagagg tgagctcagc tgtgcctgtt 300 

accgtccaca tgtgcaagcc cttcagctcg gtggttgtac ttgtttttga gatgcagttt 360 

cactcttgtc acccaggctg gagtgcatgg catgatcttg gctcgctgca acatccgcct 420 

cccgggttca agcgattctc ctgtctctac taaaaataca aaaaCtagct gggtgtggtg 480 

gtgcgngcct ttaatcccag ctactcagaa ggctgaggtg caagaattgc ttgaacctgg 540 

gaggtggagg ttgccgtggg ccgagatcac gccaccgcac tccagcctag gcaacagagc 600 

tagactgtct caaaaaaaaa aaaatgaccc tcgag 635 



<210> 95 

<211> 3798 

<212> DNA 

<2L3> Homo sapiens 

<400> 95 

ccacgcgtcc ggggcttcac acaggaaatc nactgctgtg tcaagttcca gagaaaagct 
tctgttcgtc caagttacta accaggccaa accacataga cgtgaaggaa ggggctagaa 
ggaagggagt gccccactgt tgatggggta agaggatcct gtactgagaa gttgaccaga 
gagggtctca ccatgcgcac agttccttct gtaccagtgt ggaggaaaag tactgagrga 
agggcagaaa aagagaaaac agaaatgctc tgccctzgga gaactgctaa cctagggcta 
ctgttgattt tgactatctt cttagtggcc gaagcggagg gtgccgccca accaaacaac 
tcattaatgc tgcaaactag caaggagaat catgccctag cttcaagcag tttatgtatg 
gatgaaaaac agattacaca gaactactcg aaagtactcg cagaagttaa cacttcatgg 
cctgtaaaga tggctacaaa tgctgtgctt cgttgccctc ctatcgcant aagaaatttg 
atcataazaa catgggaaat aatcctgaga ggccagcctt cctgcacaaa agcccacaag 
aaagaaacaa atgagaccaa ggaaaccaac tgtactgatg agagaataac cLggytcLcc 
agacctgatc agaattcgga cctccagatt cgtaccgtgg ccaccactca tgacgggtat 
cacagatgca taatggtaac acctgatggg aatttccatc gtggatatca cctccaagtg 
ttagttacac ctgaagtgac cctgt-tcaa aacaggaata gaactgcagt atgcaaggca 
gttgcaggga agccacctgc gcatatctcc tggawcccag agggcgattg tgccactaag 
caagaatact ggagcaaigg cacagtgact gttaagagta catgccactg ggaggtccac 
aatgtgccta ccgtgaaccg ccacgtctcc cattcgactg gcaacaagag tctgtacata 
gagctacttc c-gttccagg tgccaaaaaa tcagcaaaat tatatattcc atatatcatc 
cttactatca ttattttgac catcgtggga ttcatttggt rgt-gaaagt saatggctgc 
agaaaatata aattgaataa aacagaatct actccagtzg ztgaggagga tgaaatgcag 
ccctatgcca gctacacaga gaagaacaat cctctctang atactacaaa caagccgaag 
gcatctgagg cattacaaag tgaagttgac acagacctcc atactttata agttcttcga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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ctctagtacc aagaaacaac aacaaacgag atacattata attactgtct gattttctta 1380 

cagttctaga atgaagactt atattgaaat taggttttcc aaggttctta gaagacattt 1440 

taatggatrc tcattcatac ccttgtataa ttggaatttt tgattcttag ctgctaccag 1500 

ctagttctct gaagaactga tgttattaca aagaaaatac atgcccatga ccaaatattc 1560 

aaattgtgca ggacagtaaa taatgaaaac caaatttcct caagaaataa ctgaagaagg 1620 

agcaagtgtg aacagtttct tgtgtatcct ttcagaatat tttaatgtac atatgacatg 1680 

tgtatatgcc tatggtatat gtgtcaactt atgtgtcccc ttacatatac atgcacatat 1740 

ctttgtcaag gcaccagtgg gaacaataca ctgcattact gttctataca tatgaaaacc 1800 

taataatata agtcttagag atcattttat atcatgacaa gtagagctac ctcartcttt 1860 

ttaatggtta tataaaattc cattgtacag Ctacatcatt atttaattaa aaacaaccct 1920 

aacgacggat atttagattc ttttaagttt tgtttatttc ttttaagttt tgtttgtggt 1980 

ataaacaata ccacatagaa tgtttcttgt gcatatatct ctttgttttt gagtatatct 2040 

gtaggataac tttcttgagt ggaattgtca ggtcaaaggg tttgtgcatt ttactattga 2100 

tatatatgtt aaattgtgtc aaatatatat gtcaaattcc ctccaacatt gtttaaatgt 2160 

gcctttccct aaatttctat tttaataact gtactattcc tgcttctaca gttgccactt 2220 

tctcttttta atcaaccaga ttaaatatga tgtgagatta taataagaat tatactattt 2280 

aataaaaatg gatttatatt tttggtcatg tttgtaagag agtgaatgca cgtgtgagaa 2340 

cattagcttc ttctgaactc attatatctc cacagaggtg ttgatacttg atgcctaaca 2400 

g~tttgcaga tgtgctacat tggaattgtg tatttttatg gtgtacattc tattgtgata" 2460 

tatttattga ataattaatg tctattgacc atataagtgg cgaaaaatgc accatagagg' 2520 

acatggggta tttatttaca aactatgagc tacataataa gcaagtggcc atgggatggc 2580 

a-gaccctcc cctccatatt tttgtggagc aaaatattgg caatgtttat gtaaatcatc 2640 

gttaatatco tgaaattatt tttaattaaa aacataagtc fcatttgctcc atagcagaaa 27 00 

aaacatgaga agtxttttca tcacgacaga oattgaaaca aaatatattc attcttcaat 27 60 

cataccatct gagattttta agacagctat tttgtcttat aagtatattt ttctccctct 2820 

agacatttca gttaczatgg attttgtcct caaagggacc tttagtctat tttggatgta 2880 

aagctaatct aatgacactt ggcacatgat attttgatca agccattttg acttgaccaa 2940 

aaagcagtgt ccattaggtt tctgcatata aatattacca agcaatgttc acaatagaca 3 000 

ccattacacT; gtcctxgaaa tttattaatt cttcatccaa ccctggttga gctgaggctc 3060 

atagttaggt tcaagactat ctgtttaaat attactgaaa aacaaagtaa gacagtacta 3120 

tgcttacctc ttaacttgat aatgtcaaaa caggcatgtt aaatgacatc atagaaaaga 3180 

cttcaagata atttatagaa gttaaattat attgtacaga aaataattgt atgaaaatct 3240 

ctactatggg gctggaacat ggttgaacat tagaatgata taaaaaatta tatatattct 3300 

ccaaatccac gctagacccg tcaaattaga gaatctagag attagacctg gcgtgzcagc 3360 

aaggtcatcc aggaagcaga ggctgagacg gagttaggtg tgattactta catagtcgat 3420 

tacattttac aaataacatt ccatatgtct catttactgt gctttctccc caccccattt 3480 

tgtatctttt cctttgcttt gctagatttg tcaattttct ctctctttct gtctctctct 3540 

ctttcaatat ctctaataat ttgaaagtaa ttcatcataa ccaaatatict attggggtta 3600 

tgctccactc acaaacttcc gaaaacggct ttactgagat ataattgata tatttaagtg 3560 

tacagtttgt taaattttgc acatatttaa aatgtggact ttggtaaatg ttgacatagt 3720 

tttacatctg cgaaaccatc agcataatca agataataaa cttgtccatc accccccaaa 3780 

aaaaaaaaaa aaaaaaaa 3798 



<210> 96 
<211> 2683 
<212> DNA 

^213> Homo sapiens 
<220> 

<221> SITE 
<222> (2640) 

<223> n equals a t ^,g, or c 
<220> 

<221> SITE 
<222> (2676} 

<223> n equals a.t.g, ox c 
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<40O> 96 
acaasgtmac 
gcatttgcaa 
tgttgcgggg 
gatcccgttt 
tgcactccct 
tgtatgtatt 
aatcagttgc 
ttatcaacta 
ctgagaatat 
gacctatgaa 
tgtatttaaa 
cgtaagatct 
ttgtgcagct 
tcctaaaaac 
tgataaacag 
gagtatacct 
gtatgacata 
tcataatgga 
gggaaggaaa 
tcaggtgr.gg 
ggcataatga 
gagttgatga 
aaatgatcaa 
attaatggtt 
ccactaaatc 
gtgtgtgtat 
ttcccctacg 
caataaaaaa 
tgaatctggc 
ataatttgtg 
tgatttttac 
tagttttctt 
ctttgtgtat 
tactttcttc 
agagaagaac 
tgtccacagg 
gagtttgttt 
ctgttggtct 
gtcctgtact 
ttttttctta 
caaatagcag 
tagtaggtat 
acatctgtat 
tttaattact 
ccatgccata 



gcctgacagg 
taacagaaaa 

^gggggtcgg 
gtgtgtgcat 
gtgtattttc 
tttctgtcat 
ttgcttcttt 
ccccaattgc 
gttttagaga 
taatacacct 
ccaagtctgt 
tcacaccaaa 
gacagargta 
aaattgtttg 
ttgaaattca 
attttcttag 
ggaaagtcat 
acatgtaaat 
agtggt-cta 
aaaatatttt 
ctgctggttt 
ggaccacctt 
ctttaatcca 
ctcatgactt 
gttgacttga 
atatgtattc 
catgtgtatc 
aaaaccaaaa 
attatagtga 
tctaaatttc 
attgctttag 
cctgtttttt 
gatgctgaat 
agttgtagaa 
aggtaatagg 
agaaactaac 
ccatcagttg 
catggtcaat 
atgtctgtaa 
gcctacgctc 
ttctcagaat 
gcatatgttt 
gcacacat.ct 
tcgtgagtg^ 
cttggttgga 



tmaccggatc 
ggaattgcat 
ff9gatagttt 
attttatcca 
aggacatggc 
cactgttccc 
caaagtgctt 
atgttctcca 
tattggaata 
tagtctagtt 
gaatacctgc 
gtaggaagta 
atgttacatc 
gaaagtacaa 
agatgtgttt 
cg^tatctgcc 
ttTtttitttag 
atactgaaaa 
ctaatgttcc 
ttagttgatt 
tccagactgg 
tgtgtataca 
ggtagaattc 
^tgtggcatc 
attttgggaa 
ataaacaagt 
ccacacacaa 
cagattgtca 
gcaaatgtcg 
tgttggggta 
gaaacatttt 
catgtgtcat 
tacaaaccgt 
aaggtcaatt 
gagaaataaa 
cattcagttg 
ttgactttcc 
atcaatcaga 
ctactaagtt 
actccccaac 
ctagtcaagt 
gtttctgtac 
ttgataaaat 
tattggatac 
ctattttggg 



cgggaattcc 
gtatgaagtt 
gatttccatt 
gccttaagtt 
tcgtgggtgr 
tctcctcccg 
tgaaggtctt 
Ccacatattc 
aagctgtctg 
ctttactcca 
tttttttggc 
aaaatagctt 
acctaaaaaa 
tgcaccacat 
gacccctagt 
ttgtttatct 
aattcatgga 
ctgtttttca 
aaaatcctca 
gagaatgcag 
attttcctac 
cttgtagttt 
aagatggctg 
taaaaataat 
aaaaaaaagt 
gtgtttgagt 
atggctgagt 
gttaaccagg 
tartaattta 
gaaattacta 
tactaattca 
cgtitagtggt 
ttgaatgatc 
tccctcagtg 
catacggtgg 
tcttaatttt 
aaaatgttgc 
ctttcatgat 
taaagaaaag 
ccatcccaac 
tgccatcatc 
agtactgcgt 
agctattLya 
atcttaaaaa 
gcattnaaaa 



cgggtcgacc 
ttcaatcgtg 
ttctgaaaac 
ataaagctca 
gtgtgttcat 
agtgtgcatt 
gaactcatgt 
tcttatttgc 
ggLaaggagc 
aatctggatt 
cacagagtaa 
agaaagctct 
gaaagataca 
Utttgtagaa 
catttttact 
ttttcttcac 
tcagtctgat 
ggagagaaat 
tcagagaagg 
gtttaacaga 
cgcaarrtatt 
taaaccttgc 
tacttcagtt 
gcctttatag 
tggtgttgat 
aacaagtgag 
tatagtcata 
aaacagttaa 
ggctaatttc 
aaattgtggg 
gctgtcttag 
ctcagaattc 
cagttgaaaa 
tcccacatta 
tttcagtzggt 
agttcgttct 
attaagtaat 
ctctactaat 
cacatagtca 
attgacatgc 
ccccttgcct 
gtgtgtgtgc 
ctagcagggt 
aaaaaaatct 
ttg 



cacggcgtcc 
ggcttttctt 
gacagacttg 
tctgtcccgc 
tgtgtgcgtc 
cagttaatat 
gtgagcatct 
tctg tacccc 
aggcttagee 
gecagtattg 
caagttttca 
gtcaggtgtt 
eggtcagtta 
gtc-actatt 
ctttggctct 
cttttaacaa 
ctactcttat 
atgagttgga 
tatgatgttc 
agagataagg 
aatgttctca 
attggtaaca 
gtatgataaa 
catctctctg 
atgtatatgt 
tttcatagtc 
aaacaatttg 
tgttttttaa 
taatactacc 
gagttttttc 
gtaaaatgaa 
tgatcagtaa 
cgtatccctc 
taccaaccta 
Cttggccatg 
accctgtgag 
agttgtcact 
tattagtaga 
cttcatctct 
tatctgtgga 
-ggccgttca 
a ta batata t 
taaagtggct 
ggacccagaii 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
24O0 
2460 
2520 
2580 
2640 
2683 



<210> 97 

<211> 2181 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (5) 

<223> n equals a, t,g. 



<400> 97 
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gtacnggatt 
agatgattct 
aatttttaaa 
ctttatcatt 
tggctatttt 
attcgtgctt 
cctcttcaca 
caaccggatc 
tgggaaagcg 
gcagagggcc 
cgtgtccctc 
caagatcctc 
catcggcaac 
cacacggcgc 
aatcaggacc 
. ggccaccaag 
! gctggtggac 
attcatcatc 
aggaagcccc 
gagctacctc 
gaaggagcgg 
cttcacctcc 
cggcatgtcg 
gcagccagga 
agtggatggt 
caacattgat 
tgtcacagtg 
gcgcacttac 
gatcacaccg 
gagtgtggtg 
cgtcaccatc 
gcccttccag 
ctgccacgga 
a 9ggcccagc 
cgtcctaaca 
cggggaagag 
gagtacagga 



cccgggtcga 
gggcctcccc 
cctactgtag 
agatatgacg 
gtgcactact 
gacagcag-g 
attctccatg 
aaagtatgga 
tacattcacc 
accaggctcc 
atcgtcttcc 
aacaacaccc 
gacgtggact 
gtgcacgagg 
ccgctcctct 
accctgttcc 
aggaagccgg 
ctgaagacag 
aggcctggag 
accacaaagg 
ctgcggcagc 
a tgaagctga 
gctgccatgg 
cctttgctca 
gatccccact 
gggcagcccg 
aacggagagt 
ttgcgcacta 
agcagagtca 
g^ggggagct 
cagggctcca 
cgacaccacc 
ctgctgggLc 
cagaacctca 
gtgaaaggcc 
cagurtagayt 
tcacctggca 



cccacgcrtc 
catctactgt 
tacaacaagc 
tcaatagaga 
ttgctcctaa 
cttctatggt 
acctccgacc 
aggaccactt 
atatgtcacc 
tcaacaagta 
tgacggatgg 
gagaggccgc 
tcaggctgct 
aggaggacgc 
ctgacatccg 
ccaactactt 
atcapctgca 
atgtgcctgt 
gcgatggaga 
agccgctgag 
gggcccaggc 
gggggccggt 

gacccga a cc 
agaagccata 
ttgttgtgga 

gggacatcct 
taattgggg C 
tcaccatcct 
tcttggatgg 
ggggkctgga 
tagcctttgt 
tgggtttcta 
agttcctgaa 
ctcaccctcc 
accaagtccc 
gytggtttgc 
t 



cggcaggcag 
cattaaccaa 
caggatcgcc 
acagagcatt 
agaccttcct 
gggaaccaaa 
ccaggaccgt 
gatatcagtc 
cactggaggc 
cgtggcccac 
gaagcccacg 
ccgaggccaa 
ggagaaactg 
aggctcgcag 
caccgattat 
caacggctcg 
cgtggaggtc 
gcggcctcag 

gggggacmcc 

ctcctggctg 
cctggctgtg 
cccacgea tg 
ggtggtgcag 
ccagccaaga 
tttccccctg 
caggctggtc 
acccgcccct 
catcaacaag 
tggggacaga 
ggtgtccgtg 
catcctcatc 
cattgccaac 
-caggatgcc 
gctccttcag 
agtggtctgg 
caggaacatg 



aggggcagtg 
aatgaaacat 
cagaatggaa 
ggggacatcc 
cctctaccca 
ctccggcaga 
ttcagtatca 
actccagaca 
acagacatca 
agtggcattg 
gtcggggaga 
gtctgcatcc 
tcgctggaga 
ctcatcgggt 
ccccccagct 
gaga tea tea 
accgccagca 
aaggcaggga 
aaccacatcg 
caaagtgacg 
agctaccgct 
gaeggectgg 
agegtgegag 
attaaaatct 
agcagactca 
tctgatcaca 
ccaaatggcc 
ccagagagat 
ctggtgctcc 
tctgccaacg 
cacctctaca 
agegagggee 
agactcacag 
gtgggagagg 
aagcaaagga 
ccgccaaact 



ggcgogggga 
ttgecaacat 
ttttgggaga 
aggttctaaa 
agaatgtggt 
ecaaggatge 
ttggattttc 
gcatcaggga 
aeggggtect 

gaeaceggag 
cgcacaccct 
tcaccattgg 
actgcggcct 
tctacgatga 
cagtggtgca 
ttgcggggaa 
acagtaagaa 
aagatgtcac 
agegtctctg 
atgaaccgga 
tcctcactcc 
aggaggecca 
gagctggcac 
ctaaaacatc 
ccgtgtgctt 
gggactctgg 
acaagaaaca 
cttatctcga 
cctgcaacca 
ccaatgtcac 
aaaagcegge 
tttccagcaa 
aagaccctgc 
ggectgagge 
agatttacaa 
gattgaeggg 



<210> 98 

<2ll> 1957 

<212> DNA 

<213> Homo sapiens 

<400> 98 
acgcgtccgg 
tggcggctgt 
ggceggggge 
aegeggagtt 
acaactcgag 
gagtggtccg 
aggagtatgt 
tgtacttctt 
cacccccatt 
getegggggt 

agcgctggtt 
tggcctggct 
tc cgggctgg 
accagcgtct 



aggeggcagg 
ggctctaccc 
cgtggcggag 
cgtgcagcag 
gttccgggcc 
gctgagcacc 
ggagcagctg 
eggggacaac 
tggcctgctg 
gcccttccac 
cctttaccca 
ccgggacaca 
tgaggtgctg 
tcatctcacrc 



ctcatggcgc 

ggctccgggg 

gaggageget 

tacgccttcg 

otgtgctccc 

gccaacacct 

ctgcaccccc 

aacttcaccg 

ggaaccgctc 

tggca-ggac 

cctqagaaga 

tacccagcct 

acttcccgac 

ttccccggct 



eggegtcgng 

eggagggega 

gcacggtgga 

tcaggcccgt 

gcgacaggtt: 

actoctacca 

aggaccccac 

agtgggcctc 

cagcttacag 

ccgggtactc 

cgccagagtt 

gcaccgtctg 

cgctggtggc 

agecaaeaca 



gttgctcgcg 
cggcgggtgg 
g-gtegggee 
catcctgcag 
gctggcttcg 
caaagtggac 
ctccctgggc 
tetctttegg 
ctttggap.tc 
agaagtgatc 
ccaccccaac 
cacggccctg 
atgetacget 
gctggcagga 



ctctgygcgc 

cgcccgggcg 

gacctcacct 

ggactcaegg 

tttggggaca 

ttgcccttcc 

aatgacaccc 

cactactccc 

gcaggagctg 

tacggtcgta 

aagaccacgc 

gagtgtacca 

caaccttgac 

ctgccggtca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
. 780 
840 
900 
960 
1020 
1080 
114*0 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2181 
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cacaccagca cgtcccacct cgtgctcacg gattttatta cacagatagt ggcggcaatg 900 

gcctcagccc agcccaccct cacctgcttt tccagcccac aaagggggac gatcacggcc 960 

cagcaaaagc gatgctgaga ggggaaacag tccagagtcc aacagcagaa cttgggggaa 1020 

gcggtcgggg tggccaggaa cataaactat gtatagcggc cgggggcttc tgcccagggc 1080 

tcccctggac caggacgcca ggtagggcag ggaaccccag tagtcctcca cccagccatt 1140 

ctcagagatg aatgcgtcaa taacctcctt catagccaag ttggggatga gctgttcctg 1200 

ggtcaggggg ctccgggtca cggggtcaaa atgacccaca cgctgcagtg acaagaaggg 1260 

cagagggcag tcatggggcc caggaccatg ccactggccc tgctccccca gccgcaggcc 1320 

tcacctgcag gtgctcctcg atgtccttgc ggtcgtaggt gatgccactg ggcgtgatgc 1380 

acggctcccg catcagctca aagctgatct tgccacacag gtagtcgggg atgtctcgct 1440 

tctgtggcac aggggcacac ggtcagaggc tgaaaagggc cactgcacga gcacctgcca 1500 

gccatcggca gcaagcgaca cacactcacc ttcctcttct catccacctg agaaaaaagc 1560 

tcgtccatgt ccgccatgta cttgtcctgt gaagagttga gtgctgtgct tgggggagac 1620 

accccacctc cctcctccaL ggggcacaga cccaacacaa ggcggggatg ctcccacgcc 1680 

acgtgcacac acacagaccc acatgtgggt ggggggcacc ctcacgtgct tggcctcaat 1740 

gcaggcctgc tgggcccgga cg^ggctgtc g tec tea tea ccctcgtggt ttcgctggca 1800 

ctcttccagc tccctggggg ttgaccagga geeggtcaga gatggacctg gccagatgtc 1860 

tgaccacacc ccaatctcag agctaacatc cacacttccc cacatttcct gettgecagt 1§20 

aaagccntcg acaaacaaaa aaaaaaaaaa aaaaaaa 1957 



<210> 99 

<211> 1112 

<212> DNA 

<213> Homo sapiens 



<400> 99 

ccacgcgtcc gcccaccaac aaoggcactt gcagttacat tcaccacatt tgtaaeggag 60 

ccattgaagc aiattggaaa aggaactggg gaatttatta aagcactcat gaaggaaatt 120 

ccagcgctgc tccatcttcc agtgctgata attatggcat tagccatccc gagtttctgc 180 

tatggtgctg gaaaatcagt tcatgtgctg agacatatag gcggtcctga gagagaacct 240 

ccccaggcac ttcggccacg ggatagaaga eggcaggagg aaattgatta ^agaccrgat 300 

ggtggagcag gtgatgeega tttccattat aggggecaaa tgggccccac tgagcaaggc 360 

ccttatgcca aaacgtatga gggtagaaga gagattttga gagagagaga tgtcgacttg 420 

agatttcaga ctggcaacaa gagecctgaa gtgctccggg catttgatgt accagacgca 480 

gaggcacgag agcatcccac ggtggtaccc agtcataaat cacctgtttt gga-acaaag 540 

cccaaggaga caggtggaac cctgggggaa ggcacaccga aagaaagcag Laczgaaagc 600 

agecagtegg ccaagcctgt ctctggccaa gacacatcag ggaatacaga aggztcaccc 660 

geageggaaa aggcccagct caagtctgaa gccgcaggca gcccagacca aggcagcaca 720 

tacagccccg caagaggtgt ggctggacca cgtggacagg ateeggtcag cagcccctgt 780 

ggctagagga acaccagcac aaacgacagc ctcaagtctc cttcgagctt tgtaaagtaa 840 

tctcacaagc tgccaaagaa cacatcccaa gcagtaatca cagatgatga agcaccttac 900 

aggacccctc caccctcaaa cgcgcatgtc cagagaagta ttaatgeett aatagactac 960 

tgaaggctaa ccatttacat cattctaaaa atattttaag ctgtattaca gtgctataaa 1020 

ttctcctttL aaggaaaaaa gtaatagatt gtttccttac cgtgtttgaa aggaaaaaaa 1080 

ataaatccac agtaactaaa aaaaaaaaaa aa 1112 



<210> 100 
<211> 887 
<212> DNA 

<213> Kono sapiens 
<220> 

<221> SITE 
<222> (303) 

<223> n equals a,t,g, or c 



<400> 100 
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=ipp.IiillS;s 

Eiil IIP 1=111 

i--^*-,,*-.-:; ttt ^acccca taaattattc tactgtgaaa acacaccatg tttaaccaor 

~ ES ™ ~ ■"»"'-' =s= 

«!f «cS;.1S SstSt S££S SS; 
255S 55= 255= 232= ~sH =22= 



<210> 101 
<211> 1248 
<212> DKA 

< 2 13 > Homo sapiens 
<400> 101 



2H5 ~ ESS = ~ xssz 

= =55 5=3 SE SIS ESS 

s=s ~ S=2 Erin « LES 

ESS ~ = =ss 55== Lis 

=== ~ ~ ~ ==== =55= 

™cta Stacag^: tltlTcllt L C gtc C c a a^ C c tacT"^ 

SSS ^T caag cccttcttcc caca ^ SSccS: 

t^atattcct Cattttt ^a cattatgtta ttggaatttt ctttcatata 

aaaaaaaaaa a^ataaaaaa aaaa — aaaaaaaaaa aaaaaaaaaa 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 

<210> 102 
<211> 1B41 
<212> DMA 

<213> Homo sapiens 
<400> 102 

ggcacgagct ggtgcaggag ctggagcagt accagttgtt gccgaaaaaa tt™^i- rt 
SSS »«*««gg agcracgagg agttggtctr. catgtggct? 
ccagtatttc ^ CtgCaaatC tgccagcgca tcggtcctat gttggataal gaaattcclc 
ca^aggtac "EaSa tCtt£ ! et ! fl <~W-ag gcagtctttg ctacgt^g 
t*Jtln+lT cttaatttga agaagtgttc tgctttgtat taatagccca ataattatta 
£££££ L"T: 9 ^^tggaag aaaggtca.a atgtalctat aaatcatgct 
gtggactata aaatLcuctg catr.tagcat agaacattga attttattac ataagtgttt 
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aatgggtcc- tctgcattaa aaaagttgat aaagatatag atgttaagct aaaaagcttg 480 

aatttattct gagatgttgt ttgaggtttt atcaatgtga gttcttaatt ttccattttg 540 

tagctcactc ataacttcat tgaatgtaat tgtatgtgtc taataaactc tttaaatgct 6O0 

agaggtattt catttgatat ttagcttctg aaagccettt gttacaaagg gaagtgattt 660 

tctaaatttg gtagcagctt gaaaacaagc accacaaatg taaaattgca rggttctttt 720 

tttactaatt taattttttt gccaagtcat gtgctagcaa aatgtttgaa attgctattc 780 

tttttctgtt agcttggtat atatgactta gactaagtat atgtatgtga tataatatat 840 

gtgtacatgt ggctaaactg ctaatgggca aagattgttg gttcttctga tgctgagtat 900 

acatcaagtt catcccatga ttattttcta ttacgagcta ttgctgatgt gatgctactc 960 

ttagctgcac caccagtata aagcttaacc gagaaagaac acccctccct cttagttccg 1020 

tgccagactg gtttgttagc tttagttgtt tcttagtgct ctagccatgt caccctgtgg 1080 

cactaatcag gacactgttt atattcttcc tcttctgtgt ccctattcac tttgtaatta 1140 

ttctatttta gctggttata tggtagttaa ttggtagcct gttttccttt gttttcttta 1200 

cttgtgtcaa gctccatgcc gtLgLLLLgL gatgagcagt gatttagaac tggccagacc 1260 

ttagaagagt cttcagcaag ctgccctagg gtaggggcat attctctgtt tgrtttgtac 1320 

tggtacccca ggaggatgcc tt#rtaactgc tctttggtca tgatagtggt aatgaatcca 1380 

ttcatgtaaa atgcaatatg tctgttagtt tggggcaccc gaggctcagt agaggtttag 1440 

tatgatgcct tccctttctt ctttgagtra tgaagaaact aatttctggc acagtgtgac 1500 

tgctccaaac ctatgttctg ctgc-atctc ctatagagga gggcagactt ttttttctta 1^560 

gtgcatttgc tggtaggtca aggagggatg gactttgaat tagtgrtcag agtcacagat 1620 

cctacgtatg tgctcttcag tagaggattt tctgtgatcc tacaatgaag ggaaagctat 16 BO 

ataatttatt ggtacttcct aaagatagag actaaagtca tggtagtatt ggccaaatta 1740 

attttaaaac tgaacaagtc atgaattttc atataaatca aagaactttt tccttataaa 1800 

agcttccatt tttgaaaagc tigaaaaaaaa aaaaaaaaaa a 1841 



<210> 103 
<211> 685 
<212> DMA 

<213> Homo sapiens 

<220> 
<22i> SITE 
<222> (678) 

<223> n equals a, eg, or c 

<220> 
<221> SITE 
<222> (679) 

<223> n equals a,t,g, or c 



<400> 103 

actacggctg cgagaagacg acagaagggg ggcggcgacg gaggaggagg arggaggcgg 60 

tggtgttcgt cttctctctc ctcgattgtt gcgcgctcat cttcctctcg gtctacttca 120 

taattacatt gtctgattta gaatgtgatt acattaatgc tagatcatgt tgctcaaaat 180 

taaacaagtg gctaattcca gaattgattg gccataccat tgtcactgta ttactgctca 240 

tgtrattgca ctggttcatc ttccttctca acttacctgt cgccacttgg aat-.atat.atc 3 00 

gatacattat ggtgccgagt ggtaacatgg gagtgttcga tccaacagaa atacacaatc 360 

gagggcagct gaagtcacac atgaaagaag ccatgatcaa gcttggtttc cacttgctct 420 

gcttcttcat gcatctttat agtatgatct tagctttgat aaatgaccga agctggagaa 480 

gccgtggttg aagtcagcct. acaccacagt gcacagt^ga ggagccagag acttcttaaa 540 

tcatccttag aaccgtgacc atagcagtat atattttcct cttggaacaa aaaactattt 600 

ttgctgtatt ztcaccatat aaagtattta aaaaacarga aaaaaaaaaa aaaaaaaaaa 660 

aaaaaaaaaa aaaaaaanna aaaaa 685 



<210> 104 
<211> 1168 
<212> DMA 
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<213> Homo sapiens 
<400> 104 

ggcacgagcc cccgccccct ccctccctcc cctccct.tr.c ccggccccgg ctctggcccc 60 

ggcccattcg ctgttgggtc ttctgctagg gagga^gtcg ggztcgtcgc tgcccagcgc 120 

cctggccctc tcgctgttgc tggtctctgg ctcccccctc ccagggccag gcgccgctca 180 

gaacgtgaga gtacaatctg gacaggatca gaagtagaga atgaagttgt aagagaaagg 240 

gaaagacaga agaaaggctg cagtagtaca aggagaaaag caggazgcaa gaatgaggaa 300 

tgaatctf.tg tr.tgaggagc atccggaaaa atataagctg tcagaaagag taaatagacc 360 

agggacctct aaagtaaatt cacacatcaa agttaaaata atgttggaga aLcacctcct 420 

gtgaaaaatL ygcttagctt tcagtatgcc tctttaaaca aaacattatc attttataca 480 

aatttttaaa atgttgttca taatatggag tttaatttta agcaccttag ataatgcatt 540 

tgttagcttg tttgtgaaat ttttatggat tttttttttc aaaattccca attttagttg 600 

gtaaggacta acztcgggaa gacaggaaac ccctccagta aaattaattg gttataaatg 660 

gttacatatt ttaggcttat atacataaca aadj t Lc taa ctgaatttaa aagtgatctt 720 

gtgcaaaaac attatrctag tgatattgat gtctaaattt aaaagagtga atacacaagt 78C 

aaaatatatc tgcttzttaa ggatctattc agtttaggaa ggagagtcaa aatttgtatt 340 

taatttcaat ttattatatt gEctgaaact gaaagtaggc ctaacttttg tttgcttttg 900 

tgtatgtaca aaggcaaaca tttatcaatc aactcttatt aatttgagat tatttLyacc .960 

tgattccLca gaacttttgt ctatctgtat agaaatgggg ttttctaaat atttaaagaa 1020 

tttcctacgt atgtaaattt acttgcacac agtaagttag gggagacatt taatattcct 1080 

caqatacctg ctgttttctt gttgtgtttc t-gtttttca aataaataaa ctgagtgtta 1140 

tctgttcatt aaaatagaat gtagtcag 1168 



<210> 105 
<211> 1175 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITS 
<222> (24) 

<223> r. equals a,t,g, or c 



<400> 105 

gcatggaaat atatcctata catnaagtgc tgtcctgagc tgtacatwta gccatattta 60 

grattttatt argcaggatt caaggcctcc taaaaatgga tattcagata tttaaaaaca 120 

cactattaag gaatgcaatg taaaatcaca ctaacatttg catagcactt tatagtttac 1B0 

aaaatgcctt cacatatacg agctcattta tttctccttc tcttctttct attcatctat 240 

caaggaatat cttctataag ccaggcatca ggcttgacac tgaagacaca aaatgaaaaa 300 

gatatacaag tctctattct taaagagttc gtagtttaat gaagtgaccc aaaaagtaaa 360 

cagttgcaat ataacatgca aaggagtgcg atgaaataca cctagagtac tatgaaaacg 420 

gcatagaggg accaagcctg taatgaagtc agagaaagta aaagaggtga agcckgagct 480 

aagttttaaa ggatgaacaa aaattgacca gacaaaggag ttggagaagt gggacggtac 540 

agga-aaaag cctttctaat ttggggaagt gtgagtcgtc gatattagat gatattatta 600 

atagattgtg cacagagtgc acatggtaga gaggtgagag atgaggtagc actgttaggg 660 

cagataggLy aattctcaca gtaaccctgt gwtgtacmtt atcatcctcc tctccattan . 720 

gtaggggaga acaccaaagc ccaaagagat taagtaactt tccagaatt- tgatttcaaa 780 

tcctaagcgt ttggtagctg tgtttccctg catar.agt.ct gcctcaaaag aaaatttaaa 840 

gttgtatttt gtattaaatt taaacaaata raggccgggt gcagtggctc acgcctgtag 900 

tcccagcact ttggtaggcc aaggtgggcg gatcacctga ggtcaggatt tcgagaccao 960 

cctggccaat atggtgaaac cccatctcta ctaacataca aaaattagct gggcatggtc 1020 

gcacatgcct gtaatcctgg ctacttggga agctgaggca ggagaatctc ttgaacccag 1080 

gaggcggagg ttgcactgag ccgagattgt gccactgcac tacaacctgg acaacagagc 1140 

aagattccgt ctcaaaaaaa aaaoaaaaaa ctcga 1175 



<210> 106 
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<211> 1021 
<212> DNA 

<213> Homo sapiens 
<40Q> 106 

ggcacgagaa taaaatagat ttagggtatt cgacacattt aatacotaat gaatagzgoa 60 

tttcttgaat aatataacgc ctaaatgtga ctgatatgca aaatatttaa atcccttaca 120 

ggatgtgcat ttaccaatca gaacacatgt tggctttact gcttgtatca gttttctgta 180 

ttagtttgct tgtattagtt tgctggggca gccataacaa agtaccacag aaatttatat 240 

tctcacaatt ctgggggcta gaagactgag ataaagatgt ctgcagggtt ggtttcttct 300 

aaggactctc ttgttggctt gtagatggct gttgtctccc tgtgccttca cctgcacttc 360 

cctctgtacc tgcccgtatc ttaacctctt cttataagga cagagtcata ttggattagg 420 

gccccaccct aatgacctcc ttttaactta attacctctt tagagactct gtcttcaaat 480 

acaggcacat tctgaggtac tttaaggtta gaacttcaac atatgcattt ggaactgggg 540 

cataactgag tccatgacac tattggagag ggttctggag ccataacaga tggaatgaat 600 

gggccttgtt tggattctaa tttaaataaa ccaactatag aaaaacatgt tttaggcaat 660 

cagatttttt gttatgggct aaitatttag aagatgtcaa gaaatgttta ttaatttcat 720 

taggtaaaat aatgttggtg tggttatgta gagtatgact tacgtgcaca gaacagaatg 780 

ataaggtgtc caggatttgc tttaaaatat tctagat- tt ttcaaaaggt gagaggeaat 840 

gacacatgta tatctaaatt ttgatggttg ttgaagccgg ctggtaggta cctagtattt 900 

catta-actt tttcctttgt gtgtttggaa atttttataa taagcaggaa aaataaaagt 960 

oagaatatat gtattataga taatttgaaa tatgtagaat catgaaaaaa aaaaaaaaaa 1020 

a 1021 



<210> 107 
<211> 830 
<212> DNA 

<213> Homo sapiens 



<400> 107 

ggcacgagta aggactgtgt tctttatgca tttcccgacc caggcatggc agttcctctt 60 

ttcctgtaca tartcacact cctgccactt ctaccctttc tcttatccct ctgcttttca 120 

cctctgactg taaaaagaag tagcagttcc gaaagcaaga gttccctatg aacacggaag 180 

aagacattgg caacttttga gtacaacaac tatatttaat agagtaattc aagaacatca 240 

gccagtgaat tttatacaag atagtgaaag agaaaaggaa gattaattag gggtagttta 300 

ggatgccatt aaatagccta gaattagggg agtagccgtt gaatagaaag gaggccacaa 360 

attcgaggga tataagctaa gaattggtaa gccaagaaga aggaaaaggt ttgggcagta 420 

aggataazga ggaacaaaat agagaactca gaagcaatat ctgactgtta tcattggaag 480 

aatttttrtg cttgcttgag gctggatatt gaagtggatc aggatacttg agtgactatc 540 

tgatgggctt ttggaactag ctctcaagag gtgaaaatta gctttttttt ctttitcctt 600 

cttttttttt ntttttgagg caaggtccca ctgttgttga ggctgaaccc cctgggctca 660 

agcagttgtc ccattgcagc ctcctcagat actctgtaag ccaaggcagg gggaatattt 720 

tgtgctcagt. agtctgaggc tgtcgtgagc raagatcaca ctgctgtgct cacctcagcc 780 

tgggcaacac agtgaaaccc cgtctccatc tgtttaaaaa aaaaaaaaaa 930 



<210> 108 

<211> 1301 

<212> DNA 

<213> Homo sapiens 



<400> 108 

aagagggtgg crgcrgagtt rttgtttgtc ctgtgctctg agagtggtga gttatggaga 60 

cccaggtccc agcccacccc acctcagcca ggcttgcaca ctgaccictg ccttgcccct 120 

cagtgccccg attuatcaag cacacaggct atgggaatgc tgccggcctt ctggctgcca 180 

ggggcctcat ggcaggaggc cggcccgagg gccagtac-c agaggatgag gacacagaca 240 

cagatgagca caaggaagcc aaagccagca taaaccctgt gaccgggagg gcggaggaga 300 

agccgcciaa ccctatggag ggcatgacag aggagcagaa ggagcacgag gccatgaagc 350 
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acagagtcat 


ccagccaatg 


gggetgagtc 


420 


ccatgtgcga 


gactatggag 


cagcagctct 


480 


gatggcagct 


cttctgctcc cccatcagga 


540 


ttgcaacctg 


gggaagccac 


atcccactgg 


600 


gaaacccctt 


ctcttgactc 


ccgttctgtt 


660 


tctggactgc 


aacggtcttc 


ttgtgctaga 


72 0 


aagtactttc 


ttttgtctgc 


gccaagagga 


780 


ggcctacctg 


ggcacacaga 


agagcacatg 


840 


caaagcaagc 


accatctggg 


attggcacac 


900 


gcacttccca 


gggagaaaac 


ctgtcagaac 


960 


tccaaggtat 


gtatgctctg 


ttgttcctgt 


1020 


ctcttagaca 


ttctccttgg 


tccncgrtca 


1080 


gttctcgctg 


gttttggcaa 


ttaaacctcc 


1140 


aggaaaatgc 


ccctcgtgtg 


accatagatt 


1200 


cagaaacatc 


atcttgaaat 


aaagaagagt 


1260 


aaaaaactcg 


a 




1301 



<210> 109 

<211> 1932 

<212> DNA 

<213> Homo sapiens 

<400> 109 

aatttttttt tttttttttt tttttttgaa aaaaaaaatg ggtagtgtat attttgcagg 60 

tttaagacaa ctcaggacaa taaaaacaat ggactttaca tgtgtatata tatagctctc 120 

ttaggcacca taatcagtat gagccaacaa tatttaaact tgattcaggc cacattcaga 180 

catttgctct tatatacaaa tatttaaatt aaatacaatc tgaaatgtgt tctgttacat 240 

acaaaaaagg aaaaactata caacgcagag cagtgtgtgt gttttaaata attacattta 3O0 

cacgtaagct aaatggaacc agcaatggtg ctcaagtttt tatcatccct tccagaaaat 360 

ctttttctac catctcttct actttttgcc tggctttgct ggaacatggt: ttgtggtLct 420 

ccagtttcat gtccttatta gggaaggcat ttgagtagag gataggactc cctgagtgtc 480 

ctccacatcg gcttgtgact ttgctgttga agacttgact gagcacattg aagaacggca 540 

ggagctgctc catactgcgc acggtgcaga tggtgagcag caagtgccct ggctcccaac 600 

ccaatgttct ccctgagttg tcztcctctg gatttttctg cagaaaacaa aaagtgaact €60 

ggtattaata caacagacaa tgtggtatgt tagaaaaatt aaaaatatat aaactttggc 720 

aattggtcaa gaaazgaata caaatgacat taagtttcta actcctgacc tgatcaaaac 780 

ccttggtgct tctgagacct tttactgcca tttattagtt ttacatggag cagtctaaca 840 

ttgtagcaat agttcccaac tagaatgcgc agataagctt agttaacaga aatagctttg 900 

aacaggaata gagtcaaaca taaaagtttt atgttgtgct ttgtatttac tcaaaaagct 960 

cccaggtttc tgaaccctca ctactgtaac caaggactag gtcacaaaat tactacagaa 1020 

aaaaggaaca aagtgcttta tacacttcat aatatatccc cttttattat aattagttaa 1080 

ttcccttcta tctaaatggc ctaaatttgc catgatggta gcagtgtcca aagtgaataa H40 

ttactgtcag tactgcatca cagagaaagg aagggatccc tcaggagaca ctgctgtctc 1200 

Cttctgggtt gtgctaaaca acatagggag gaaagctgga cctggagtca aaggaartga 1260 

gttagtgtgc tggctctgcc atacttocgg cacccttggg caggatatae aaaggtccct 1320 
cacttataaa atgggacagt ctaaaactac cttttagtag agaagtcaaa tgagaaggta 
tgtgaaaact ctgtcaacta aatataaaga ctaataattt gggtattaag aqgctagttt 
gagaagccac ctgaattaca caaacacagc tacagacatc attctgtcta gagaaagata 

agagagaaca ggttggttga acttgggcag aatcacagat acaattccac actaaagaat x JDU 

gaaaataagc aatgaactag acagaaggaa gaaatcatga agacttagga agcagaatta 1620 
r^^^^ Cattaacaaa tggagtttgc cttctaagat cagatgttgc tcagaaactt 
tcattgtt.a cctaataatt taatatcact agtttcctag tgggtcaagc agatgcaaaa 

tccagcttat cttcttetat gcgc-ctcaa gcttattgct tattttaaag taaaatcctg iWUU 

aaaaaggaaa atattaggtt ggtgcaaacg taattgcggt ttttgcattg ttgaaatttg 1860 

ccgttttata ttggagiaca ttcttaaata aatgtggrta tgttatacaa aaaaaaaaaa 1920 
aaaaaactcg aa 

y - 1932 



1380 
1440 
1500 
1560 



16B0 
1740 
1800 
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<210> 110 
<211> 1534 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (1212) 

<223> n equals a.t.g. or c 
<400> 110 

tcgaagatag gttcgagocg gtggatgttg cagctgotca tgcagttggg ttcgg-gctg 60 

ctcacacgct gccccttttg gggctgcttc agccagctca tgctgtacgc tgagagggct 120 

gaggcacgcc ggaagcccga catcccagtg ccttacctgt atttcgacat gggggcagcc 180 

gtgctgtgcg ctagtttcat gtcctttggc gtgaagcggc gctggttcgc gctgggggcc 240 

gcactccaat tggccattag cacctacgcc gcctacatcg ggggctacct ccactacggg 300 

gactggctga aggtccgtat gtactcgcgc acagctgcca tcatcggcgg ctttcttgtg 360 

ttggccagcg gcgctgggga gctgtaccgc cggaaacctc gcagccgctc cctgcagtcc 420 

accggccagg tgttcctggg tatctacctc atctgtgtgg cctactcact gcagcacagc -*480 

aaggaggacc ggctggcgta tctgaaccat ctcccaggag gggarjctgat gatccagctg '540 

ttcttcgtgc tgtatggcat cctggcccct ggcctttctg tcaggctact acgcgaccct 600 

cgctgcccag atcctggctg tactgctgcc ccctgtcatg ctgctcactg atggcaatgt 660 

tgcttactgg cacaacacgc ggcgtgttga gttctggaac cagatgaagc tccttggaga 720 

gagtgtgggc atctteggaa. ctgctgtcat cctggccact gatggctgag ttttatggca 780 

agaggctgag atgggcacag ggagccactg agggtcaccc tgccttcctc cttgctggcc 840 

cagctgctgt ttatttatgc tttttggtct gtttgtttga tcttttgctt ttttaaaatt 900 

gttttttgca gttaagaggc agctcatttg tccaaatttc tgggcttcay cgcLtgggag 960 

ggcaggaacc ctggcaccaa tgctgtacaa ggtttttttc ctgttaggaa gaacttgagg 1020 

ccagctgccc actgagtctt ctgtccctga agaaagggag tattgggcag ggcttgggat 1080 

ccggctactg agagtgggag agtgggagac agaggaagga agatggagat tggaagtgag 1140 

caaatgtgaa aaattcctct ttgaacctgg cagatgcagc taaactctgc agtagtgttt 1200 

ggagactgtg anagggagtg tgtgtgttga cacatgtgga ccaggcccag gaagggcaca 1260 

ggggccgagc actacagaag tcacatgggt tctcagggta tgccaggggc agaaacagta 1320 

ccggctctct gtcactcacc ttgagagtag agcagaccc- gttctgctct gggctgtgaa 1380 

gsrggtggagc aggcagtggc cagctttgcc cttcctgctg tctctgtt.tr tagctccatg 1440 
gttggcctgg tgggggtgga gttccctccc aaacaccaga ccacacagtc ctccaaaaat 



aaacatttta tatagacaaa aaaaaaaaaa 



<210> 111 
<211> 2871 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (1234) 

<223> xi equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1259) 

<22.3> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1283) 

<223> n equals a.t,g, or c 



1500 

aaaa 1534 
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<220> 

<221> SITE 
<222> {1284) 

<223> n equals a,z,g. or c 
<220> 

<221> SITE 
<222> (1297! 

<223> r. equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1378) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE * * 

<222> (1912) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1913) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1935) 

<223> n equals a,L,g, or c 
<220> 

<221> SITE 
<222> 11947) 

<223> n equals a,t,g, or c 
<400> 111 

gcggcgaggc ggcggtggtg gctgagtccg tggtggcaga ggcgaaggcg acagctctag 
gggttggcac cggccccgag aggaggatgc gggtccggat agggctgacg ctgctgctgt 
gtgcggtgct gctgagcttg gcctcggcgt cctcggatga agaaggcagc caggatgaat 
ccttaggatt ccaagactac tttgacatca gatgagtcag taaaggacca tactactgca 
ggcagagtag -tgctggtca aatatttctt gattcagaag aatctgaatt agaatcctcc 
llltt^T 9 a9gaa ^ aca ^ cctcaagagc caagaggggg aaagtgtxac agaagatatc 
tfltt ? 9 Wctccaaa tccagaaaac aaggactatg aagagccaaa gaaagtacgq 
aaaccaggta gtctggacat tttccttgct ttctgattta tttaggggac aactgaaaat 
cccaagctaa tgaataaaga agctcaagaa gactggcttc actgactatt acccacaaat 
aatatatgga gtgtagtttg gagagaattt czagatttta ntataccafla gttattcact 
caacagattt gacccaagtt acataagctg aattccatag atgagattgc atttatcttt 
ttattaatat cctgacgttt cttcttggaa tcaagtatgt aaataccttt ctcactgaga 
aacctaaaaa aataatatgt cagggcagca tacaaaacag aaactcagga agaactgtct 
aacatgccac tatggtcatt -tcaggattg tggtaggatt attactacaa taaccttgtc 
gaccatccta ggtaatutga cttaggtgtt taicretcca gcttgacgct tgtgtgcact 
tccatcctat tctaacct-c acaatcttga aacctctact tctcaattra aaacttgacc 
lilt ta 9tcctc-a aatatcaatc cctttaaaaa ttattaatgt atgcattcta 
ccatgtcttg ttgcccttct gtcatetatt artccaatga tcactgcctt atctgtttat 
! a ;^^ ta ctaaccactc tcaggaactg acccaaattg gaacttttta aaaaaataga 
attttutttt actacaaata taccattctc acgaacacaa aactactttt aaaaaattaa 
™^^^! a atcccatlcaa ctagaaataa cagntgctca catcatggag aatattctnt 
t^.t tt ttag ^tgcat cannagnttt aagaaaagta aattgggtca aaatgtgtat 
tctatgttgc agcctgcatt tr.gcactc.aa cagtttacca tgaatgtttt tcca-gtnat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
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ca tttgcaca ggggaagggg gtgccctcct tcctagaggc cctgggggcc aggctgactt 720 

ggggggcaga cttgacacw ggccccactc acccagatgt cctgaaattc caccacgggg 780 

gtcaccctgg ggggttaggg acctattttt aacactaggg ggctggccca ctaggagggc 840 

tggccctaag atacagaccc ccccaactcc ccaaagcggg ggggggatat ttattttggg 900 

gagagtttgg aggggagggg gatttttttt ttaaangatt ttttantttt naaaaaaaaa 960 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1020 

aaaaaaaaaa aaaacaa in „ 



<210> 113 
<211> 2214 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (289) r 
<223> n equals a,t,g, or c 

<400> 113 

cggacgcgtg ggtcgaccca cgcgtccggg aaaaarggaa aaratgccgt gtaaaacctc 60 

gttctgtgtc tgaattgccg taggctcaga tcttcatccg aggctctgtg cctgaattgc 120 

cgtaggctca gatcttcatt tgaggttatg ttctataagt taacgttgat cttgtgtgag 180 

ctttcggtag ctggagtaac acaggcggcc tcacagcgac ctctccagcg ccttccaagg 240 

cacatctgca gccagcgtara tcctcctggg agatgcctcc tcaaggccnt gctccagacc 300 

acgtgggrar ggcctgacaa gccaattccc aggctgtccc cacccttgra gagtgaccct 360 

!^ ga c * gat 99<?ga atgggaaaga aaagaaagct gcagacctca agttaaaatt 420 

540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



ccctccaaaa cgcttttatt tatctgcttt ttctgaaagg ataaaggctt tttgaaaatt 
attttctaac aaataacatg aacacttcta gaaaccctag aaaaacacaa agtattcaaa 
atagaaagaa aaattaecca ttactcttta agccagcatt atccattgcg gtgcttttgg 
agttgggtga ggccgtagcc tctgccaagt caaggagccc ggtggtggct gtggcattcc 
tgcagggttg tttttttttc tttgagatgg agtctcactc ttgtcacccc agctggaatg 
tggtggtgta aacagctcac tgcagccttg accctgaggc tcaagcgatc cttctgcctt 
ggccr.cctga gtagctggga tcccaggcga gagtcaccac accctgtcca tgttcctgca 
ggtcttgata tgcgaggacg ntgtgtcttc cctgccacat tttcttcttc tttcttgaga 
cagacccttg ctccatcacc caggccagac tgtggtsgtg cgaacacggc tcacrgcagc 
ctcgaccctc aggctcaagc gatcctcacg cctcggaccc ccaaagtgct gggaccacag 

gcgagagtca ccacgctggc ctgaatcttc agggtatttr cggttgargt gycacttact iVOU 

tarccatscc tgtttcaaga gtgtaggtgg ccaccctgtc tctgccgctg acctggcctg 1140 

gaccctcggc tgtgagaggg aggggtgggc tgggctggag gaacctraag ccctcgtgat 1200 

grcacaagcc catctggctg ggcatcccct gctgtgtcct gagctgcaca tgccccaggt 1260 

ggcccccaca gcagaggcga gccactgrag ggcgragggc ttccacggac ggtcttcagg 1320 

ggragaagaa gggcccaggc ccccaggaga ctcaggagac cagagcctgg ggtcaggggc 13 80 

tmagcagggg ctyarccagg gccggatgtc cggagccagc cccgiuagccc tgkgktcttt 1440 

gttcttcgca ctcccaccgt ccgtgtgaac agctccagcc ccacctgcgc ctccctgtgc 1500 

tgggctccat cagggagccc agaagacgtg tgtgcttctg oaattgggtc cctacatgcc 1560 

tttgtcccag tgcaccttgc tccccccact zactatcgag atttaaatgc ctgttttctc 1620 

cccagaggtt gacggatata Ltcagacgtt acgacacgga tcaggacggc tggattcagg 1680 

cgtcgtacga acagcacctg tccatggtct tcagtatcgt atgaccctgg cctctcgtga 1740 

agagcagcac aacatggaaa gagccaaaat gtcacagtcc ctatctgtga gggaatggag 1800 

cacaggtgca gttagatgct gttcttcctt tagatitttgt cacgtgggga cccagctgta *860 

catatgugga taagccgatc aatggttttg caactgtaat agtagctgta tcgttctaat 1920 

gcagacactg gatttggtga ctgtctcatt gcgccatgaq gtaaatgtaa tgtttcaggc 1980 

attctgcttg caaaaaaatc tatcatgtgc ttctctagat gtctctggtt ctatagtgca 2040 

aatgctttta ttagccaata ggaattttaa aataacacgg aacttacaca aaaggctttt 2100 

catgtgcctt acttttttaa aaaggagctt ettgcat-ca ttggaatatg cgacgcaagc 216U 

aataaaggga atgtcagacg tgtaaaaaaa aaaaaaaaaa aaaaaaaaaa aaag 2214 
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<210> 114 

<211> 3300 

<212> DNA 

<213> Homo sapiens 



<400> 114 
tcgacccacg 
cctgcttcag 
cctgctctca 
agcttcctgt 
tgaagatggc 
tgaagatatt 
gaataaagtt 
ggctggtatt 
acagtcaaac 
acgtataact 
tagggtagtt 
ggggacactt 
gagaccaggg 
gtgcctagtg 
agctggcatt 
gcaccttaat' 
tatttcrtac 
tgagaagaaa 
tggagttgag 
ttgatatagt 
ctggaaataa 
gcactgcagt 
tagtaaacga 
cggtttgaag 
gcctggtgac 
ctggtccacg 
cttccatctg 
gaccgtgatc 
caccctcctc 
cctctgcctc 
ggtgctgtgc 
gatgctcctg 
caccagcacg 
tgtgattatt 
ttggctcaag 
cttgatzaaac 
cctcaataaa 
tcagctattt 
gaagacgatt 
gttgctcttt 
gtttagtggg 
ccaaaccaaa 
atcatcatct 
ttggctaaag 
tttccgtttg 
tagctttcct 
agaacaagac 
aagtgtcgga 
gctactgcca 
ttctggcctc 
cataattcaa 
aaaacgcatt 
gtgggactgt 



cgtccggtga 
agagattgtt 
ggtctcactg 
cctccatgcc 
tttcgggcca 
aatgaatgtg 
ggaggttaca 
atagattatg 
gtggatattc 
atgccatttt 
ctatccaggg 
ttggttgtca 
atacagctca 
ccatatgicc 
caaagaccta 
aaatatcgtt 
atttcagtaa 
tggaagcaga 
catctggaat 
ctatgaaacc 
caccatggat 
tgccctatca 
aaagggatgc 
gaaaaaattt 
aagagaacaa 
gagggctgct 
tccagctttg 
acccaggtgg 
ctgtgccggc 
ttcctggccc 
tccatcattg 
gaagggctgc 
ggcagattca 
gctgtgtcag 
cttgataaag 
ctggtgttct 
gaagtttcca 
atcctgggct 
ggatcaatca 

gtggtacact 
atgcggaaag. 
acggaagaag 
gcagagtcaa 
atgaact.gac 
tctacagcgc 
gtg^aaagtc 
atttgggaga 
aggatgcatt 
ccttgccaga 
tctgctgtat 
caattttcat 
atcaggtgct 
aaactagttc 



gaggcaagga 
cccggcgtct 
tcttactggc 
ctaaatatgc 
ggtctggcag 
aaaccgggct 
tctgtagctg 
atcatccgga 
gggtgagtgg 
cctacccaaa 
gtgattttgt 
taccttgggg 
acatcctaca 
a ta gaga tag 
catcaagcac 
gccttctggg 
gagtcaatct 
gaggactttg 
gcgagttttg 
aagaggtgca 
atcaactgtg 
cttatcaatc 
aggaagtaaa 
ccctctctga 
aacatatctg 
ctcatgtgca 
ccgtcctcgt 
ggctgaccat 
ccatccagaa 
acctcctgtt 
cagggctgct 
acctcttcct 
agaagaggtt 
caatagttgg 
gattcatctg 
acttccaagt 
ccattcagga 
gttcttgggg 
ttgcatactc 
gtctccttaa 
gggtagaaac 
tggggaagtc 
ccaagcaacc 
ctggcaagtg 
ccctgtggtc 
taggctcatt 
attcttagat 
ttaaagatgg 
aactcactaa 
tttatgctcc 
atggatcagt 
ggagaggatg 
aaccatcgtg 



cttttcatct 
cagtgctatg 
tctgccagga 
cagctgccac 
gacatacttt 
ggcaaagtgc 
tttggtaaaa 
ttgttatgag 
ggtgaagcct 
cattaacatg 
cctctgtccc 
gtgatgtgtg 
gtgcccagga 
agaaatacaa 
ctagattctc 
ctccccactc 
aattagctta 
caagaagggc 
cttctccagg 
atgagacaag 
ctgatgcttt 
tcttggggat 
actgaactct 
acctgtgttc 
tgtctactgg 
cagcaacggt 
ggctcttgcc 
ctctctgctg 
caccagcacc 
cctgacgggc 
gcacttcctc 
caccgtcagg 
catgtaccct 
accccagaat 
gage t t cat g 
tctgtggatt 
caccagagtc 
ccttggtttt 
attcaccatc 
tegecaggtt 
tgaaagcact 
czcagaaatc 
gcagccacag 
ccatggcaat 
acacatagat 
cacctatttt 
ccagagtcca 
egggegggag 
ctggcatctg 
caaagatctt 
attaaagagg 
tggagaaata 
gaagrcagtg 



ttagggcatt 
gggagcaggt 
teagaageca 
aacagcaccc 
catgattcct 
aagtataaag 
tatactttat 
aacaatagtc 
ggatttggga 
tcttcctgtg 
aaggtcatct 
cgactggcat 
cagcctccca 
gtgtaggggg 
aatgccacac 
caacacctgt 
atttttttgg 
tactcaacta 
aaagggtcaa 
ggagaatget 
aaaaggaaac 
attctgaatg 
tacgttgtga 
ctgacttttc 
gagggatcag 
tcttacacca 
cccaaggagg 
tgcctcttcc 
tccctccatc 
atcaacagaa 
tacctggctt 
aacctcaagg 
gtaggctacg 
tatggaacat 
gggecagtag 
ttgagaagca 
atgacattta 
tttatggttg 
atcaacaccc 
cgaatggaot 
gagatgtctc 
tttcataaag 
gttcatctcg 
gaeceggaag 
tggacaaatg 
ggctttctat 
gtagtgtggc 
aagtggattt 
gattcagctc 
acattaacac 
gtgttgcatt 
ggaacacttt 
tggegattec 



ctgaccacgt 
tcctcctggt 
agaattctgg 
acrtgtacttg 
ctgagaagtg 
catattgtag 
tcaactttct 
aagggacgac 
aacagctggt 
atttttaggg 
gtcaatgact 
ctggtggatg 
caatcaagaa 
aaagtgcctc 
gcaccttgta 
gcatattccc 
aaggaaaatc 
attcaaagcg 
atttctgaat 
tttctggaag 
ctaagagaga 
catccttttt 
gcggcaccat 
gecataatea 
agggaggecg 
aatgcaagtg 
accctgtgct 
tggccatcct 
tagagctctc 
ctgagcctga 
gcttcacctg 
tggecaacta 
ggatcccagc 
ttactcactg 
cagtcattat 
aactttcctc 
aagecattte 
aagaagtagg 
ttcagggagt 
ataaaaagtg 
gctctactac 
gaggcactgc 
tctctgctgc 
ttacccctcc 
ccactatttc 
gttcatagaa 
acgtgcaatg 
tettcttgea 
atagttccct 
tccacattca 
ttgeaataca 
tacactgttg 
tcagggatct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
'660 
720 
7*80 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
192 0 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
27C0 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
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agaactagaa ataccatttg acacagctat cccattactg ggtatatacc ccaagggcta 3240 

taaatcatgc cgctataaag acacatgcac acgtaaaaaa aaaaaaaaag ggcggccgct 3300 

<210> 115 

<211> 1286 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1149) 

<223> n equals a,t,g, or c 
<400> 115 

gactgcctna cggacattgg atgaagccga agcatttaga atggtgcctg gcacacagtt 60 

ggtgcgtgat atggttaagc tttgtgtccc cacccacatc tcatcttgaa tgtgacggtt 120 

tccccggctc cctcctgccg ccatgtgaag aaggtcgttg cttrcccttc accrttccacc 180 

accatgattg ccatggatgc cccccactcc aaagcagccc tggacagcat taacgagctg * 240 

cccgagaaca rcctgctgga gcLgutcacg cacgtgcccg cccgccagct gctgctgaac * 300 

tgccgcctgg tctgcagccc ctggcgggca ccucatcgac ctcatgaccc tccggaaacg 360 

caatgcctgc gagagggctt catcaccaag gactgggacc agcccgtggc cgactggaaa 420 

atcttctatt tcctacggag cctgcatagg aacccctgcg caacccgtgt gctgaagagg 480 

atatgtttgc atggcaaatt gatctcaatg gtggggaccg ctggaaggtg gagagcctcc 540 

ctggagccca cgggacagat: t^tcctgacc ccaaagtcaa gaagtatttt gtcacatcct 600 

acgaaatgtg cctcaagtcc cagccggtgg accttgtagc cgagggctac tgggaggagc 660 

tactagacac attccggccg gacaccgtgg ttaaggactg gtttgctgcc agagccgact 720 

gtggctgcac ctaccaactc aaagtgcagc tggcctcggc tgactacttc gtgttggcct 780 

ccttcgagcc cccacctgtg accatccaac agtggaacaa tgccacatgg acagaggtct 840 

cctacacctt ctcagoctac ccccggggtig cccgctacat cctcttccag catgggggca 900 

gggacaccca gtacngggca ggctggtatg ggccccgact caccaacagc agcattgtcg 960 

tcagccccaa gatgaccagg aaccaggcct cctccgagcc tcagcctggg cagaagcatg 1020 

gacaggagga ggctgcccaa tcgccctacc gagctgttgt ccagattttc tgacagctgt 1080 

ccatcctgtg tctgggtcag ccagaggttc ctccaggcag gagctgagca tggggtgggc 1140 

aagtgaggnc cctgtaccaa gcgacttctg ccccggtLca accctaccaa gcttggggga 1200 

acttactgca catagctctg acgttttgtt gtaataaatg ctttcaggcc gggcaaaaaa 1260 

aaaaaaaaaa aaaaaaaaaa aaaaaa 1286 

<210> lie 
<211> 2189 
<212> DNA 

<213> Homo sapiens 
<400> 116 

ggaattcggc acgaggcgcc cgaggatgtg ctgctggccg ctgctcctgc tgtgggggct 60 

gctccccggg acggcggcgg ggggctcggg ccgaacctat ccgcaccgga ccctcctgga 120 

Ctcggagggc aagtactggc tgggctggag ccagcggggc agccagatcg ccttccgcct 180 

ccacg-gcgc actgcaggct acgtgggctz cggcttctcg cccaccgggg ccatggcgtc 240 

cgccgacatc gLcgtgggcg gggcggccca cgggcggccc cacccccagg attattttac 300 

aaatgcaaat agagagttga aaaaagatgc tcagcaagat taccatctag aatatgccat 360 

ggaaaatagc acacacacaa taattgaatt taccagagag ctgcatacat gtgacataaa 420 

tgacaagagt ataacggata gcactgtgag agtgatctgg gcctaccacc acgaagatgc 480 

aggagaagct ggtcc=aagt accatgactc caataggggc accaagagtt tgcggttatt 540 

gaatcctgag aaaactagtc tgc-atctac agccttacca tactttgatc tggtaaatca 600 

cgacgtcccc atcccaaaca aagatacaac atattggtge caaatgttta agattcctgt 660 

cttccaagaa aagcatcatg taataaaggt tgagccagtg atacagagag gccatgagag 72C 

tctggt.gcac cacatcctgc tctatcagtg cagcaacaac tttaacgaca gcgttcctgg 780 

aatccgggca cgaattcc^a tcaccccaac atgcccgaiy cattcctcac ctgtgaaact 840 
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gtgatttttg cctgggctat tggtggagag ggcttttctt atccacctca tgttggatta 9 CO 

tcccttggca ctccattaga tccgcattat gtgctcctag aagtccatta tgataatccc 960 

acttatgagg aaggcttaat agataattct ggactgaggt tattttacac aatggatata 1020 

ayoaaatatg at_gcLggggL gaLLyaggcc ggcctctggg tgagcctcUL ccataccatc 1080 

cctccaggga tgcctgagtt ccagtctgag ggccactgca ctttggagtg cctggaagag 1140 

gctczggaag ccgaaaagcc aagtggaatt catgtgtttg ctgttcttct ccatgctcac 1200 

ctcgctggca gaggcatcag gctgcgtcat tttcgaaaag ggaaggaaat gaaatzactt 1260 

gcctatgatg atgattttga cttcaatttc caggagtttc agtatctaaa ggaagaacaa 1320 

acaaccttac caggagataa cctaattact gagcgtcgct acaacacgaa agatagagct 1380 

gaga-gactt ggggaggact aagcaccagg agtgaaatgt gtctctcata ccttctttat 1440 

tacccaagaa ttaatcttac tcgatgtgca agtattccag acattatgga acaacr.tcag 1500 

ttcaztgggg ttaaggagat ctacagacca gtcacgacct ggcctttcat tatcaaaagt 1560 

cccaagcaat ataaaaacct ttcttzcatg gatgctatga ataagtztaa atggactaaa 1620 

aaggaaggtc tctccttcaa caagctggtc ctcagcctgc cagtgaatgt gagatgttcc 1680 

aagacagaca atgctgagtg gtcgattcca aggaatgaca gcattacctc cagatataga 1740 

aagaccctat aaagcagaac cctttggtgt gtggcacgtc ttcttcctct C^cctgcaca 1800 

gagatttctc ccatcaacttf' gcttgtttgc cttctgctac tcagctgcac gctgagcacc 1860 

aagagcttgt gatcaaaatt ccgttggacc tgacaatgtt ttctatgatc tgaacctgtc 1920 
atLLgaayta caggttaaag actgtgtcca cttugggcat gaagagtgtg gagacttttc * 1980 
ttccccattt tccctccctc ctttttcc^t tccatgttac atgagagaca tcaar.caggt ' 2040 

tctcttctct ttcttagaaa tatctgatgt tatatataca tggtcaacaa aataaaactg 2IC0 

gcctgactta agataaccat tttaaaaaat tgggctgtca tgtgggaata aaagaatLcL 2160 

ttct.ttccta aaaaaaaaaa aaaaaaaao 2189 



<210> 117 

<211> 1763 

<212> DMA 

<213> Homo sapiens 



<400> 117 

ggtggcctag agatgctgct gccgcggttg cagttgtcgc gcacgcctct gcccgccagc 60 

ccgctccacc gccgtagcgc ccgagtgtcg gggggcgcac ccgagtcggg ccatgaggcc 120 

gggaaccgcg ctacaggccg tgctgctggc cgtgctgctg gtggggctgc gggccgcgac 180 

gggtcgcctg ctgagtgggc agccagtctg ccggggaggg acacagaggc cctgctataa 240 

agtcatttac ttccatga-a cttctcgaag actgaactt^ gaggaagcca aagaagcctg 300 

caggagggga tggaggccag ctagtcagca tcgagttcr.g aagatgaaca gaaactgata 360 

gaaaakttca ttgaaaacct cttgccatct gatggtgay- tctggawtgg gctcaggagg 420 

cgtgaggaga aacaaagcaa tagcacagct gccaggacct ttatgcttgg actgatggca 480 

gcata:caca atttaggaac tggtatgtgg atgagccgtc ccgcggcagc gaggrctgcg 540 

tggtcatgta ccatcagcca tcggcacccg ctggcatcgg aggcccctac a.gt-ccagt 600 

ggaatgatga ccggtgcaac atgaagaaca atttcatttg caaatattct gatgagaaac 660 

cagcagttcc ttctagagaa cctgaaggtq aggaaacaga gctqacaaca cctg-acttc 720 

cagaagaaac acaggaagaa gatgccaaaa aaacatttaa agaaagtaga gaagctgcct 780 

tgaatctggc ctacatccta atccccagca ttccccttct cctcctcctt gkggtcacca 840 

cagttgtatg ttgggtttgg atctgtagaa aaagaaaacg ggagcagcca gaccctagca 900 

caaagaagca acacaccatc tggccctctc ctcaccaggg aaacagcccg gacctagagg 960 

tctacaatgt catwagaaaa caaagcgaag ctgacttagc tgagacccgg ccagacctga 1020 

agaatatttc attccgagtg tgttcgggag aagccactcc cgatgacacg tcttgtgact 1080 

atgacaacat ggctgtgaac ccotcagaaa gtgggtttgt gactctggtg agcgtggaga 1140 

gtggatttgt gaccaatgac atctatgagt tctccccaga ccaaatgggg aggagtaagg 1200 

agtctggatg ggtggaaaat gaaatatatg gttattagga cata-aaaaa actgaaactg 1260 

acaacaatgg aaaagaaatg ataagcaaaa tcctcttatt ttctataagg aaaatacaca 1320 

gaaggtctat gaacaagctt agatcaggtc ctgtggatga gcatgtggtc cccacgacct 1380 

cctgttggac ccccacgttt tggctgtatc ctttatccca gccagtcatc cagctcgacc 1440 • 

ttatgagaag gtaccttgcc caggtctggc acatagtaga gtctcaataa atgtcacttg 1500 

gtr.ggr.tgta tctaactttt aagggacaga gctttacctg gcagtgataa agatgggctg 1560 

tggagcttgg aaaaccacct ctgttttcct tgctctatac agcagcacat attatcatac 1620 

agacagaaaa cccagaatct ttccaaagcc cacaratggt agcacaggtt ggcctgtgca 1680 
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tcggcaattc tcatatctgt ttttttcaaa gaataaaatc aaataaagag caggaaaaaa 1740 
aaaaaaaaaa aaaaaaactc gag 1763 

<210> 118 
<211> 1375 
<212> DNA 

<213> Homo sepisns 
<220> 

<221> SITE 
<222> <18) 

<223> n equals a,t,g, or c 
<400> 118 

Sgcgggcggc gcggaagngg qggckgcgcg gccggggcag ccatgtcgcc attgtctgcg 



ggcacgagag agccrgcggc ccgagccggg gccatggcga actgctgagc tigcgtcctag 
gcccccggcc ctacaaaatc taccgggaga gggactctga aagggccccg gccaccgtcc 
ctgagacgcc aacggcagtc actgcccccc attccagccc ctgggatacg tacratcagc 



60 



gcgcgggcgg ccctgcgggt ^tacgcggta ggcgccgcgg tgatcctggc gcagctgctg 120 

cggcgccgcc gsgggggctt cctggagcca gttytccccc cacgacctga ccgfcgrcgct 180 

atagtgacgg gagggacaga tggcattggc tattctacag cgaacatctg gcgagacttg 240 

gcatgcatgt tatcatagct. ggaaataatg acagcaaagc caaacaagtt gtaagcaaaa ' 300 

taaaagaaga aaccttgaac gacaaagtgg aatttttata ctgtgacttg gcttccatga 360 

cttccatccg gcagtttgtg cagaagttca agatgaagaa gactcctctc catgtcctga 420 

tcaacaatgc tggggtgatg atggtccctc agaggaaaac cagagatgga ttcgaagaac 480 

atttcggcct gaactaccta gggcacttcc tgctgaccaa ccttctcttg gatacgctga 540 

aagagtctgg gtcccctggc cacagtgcga gggtggtcac cgtcccctct gccacccatt 600 

acgtcgctga gctgaacatg gatgaccttc agagcagtgc ctgctactca ccccacgcag 660 

cctacgccca gagcaagctg gcccttgtcc tgttcaccta ccacccccag cggctgccgg 720 

cggctgaggg aagccacgtg accgccaacg tggtggaccc cggggtggtc aacacggacs 780 

tctacaagca cgtgttctgg gccacccgtc tggcgaagaa gcttctcggc tggttgcttt B40 

tcaagacccc cgatgaagga gcgtggactt ccatctacgc agcagtcacc ccagagctgg 900 

aaggagttgg tggccgttac ctatacaacg agaaagagac caagtccctc cacgtcacct 960 

acaaccagaa actgcagcag cagctgtggt ctaagagttg tgagatgact ggggcccttg 1020 

atgtgaccct gtgatatcct gtctcaggat agctgctgcc ccaagaaaca cattgcacct 1080 

gccaatagct. tgtgggtctg tgaagactgc ggtgtt-gag tctctcacac ccacctgccc 1140 

acagggctct gtcctctagt: r.ttgagacag ctgcctcaac ccctgcagaa cttcaagaag 1200 

ccaaataaac attttggagg aUatcaccc caagtggtct tcaaccataa actttgtgat 1260 

tccaaagtgc ccagttgtca caggtgccat aaataactac attttccaac acaaaaaaaa 1320 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaagggcg gccgc 1375 

<210> 119 
<211> 1022 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (937) 

<223> n equals a.t.g, or c 
<220> 

<221> SITE 

<222> (990) • 
<223> n equals a,t,g, or c 

<40C> 119 



60 
120 
180 
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cccgtgcect 
tctcttatta 
tgtccaaagc 
tggcctgctc 
cttggaagca 
ctttgacttc 
gtctcgaggg 
ggggacagca 
cctggtggcg 
gaaggccctc 
gcgccgggca 
gcgggggaca 
ttttatgarg 
tcatccagct 



ggagaaacat 
ctcctctccc 
ggtgccgttt 

cgaggcattg 
acacatcgga 
cggagctegc 
ggcccttccc 
gacacccticc 
cacaccctag 
atgcctgcgc 
aagctgctgg 
gctgagcccc 
tgggccgcgc 
ttgctggaac 



gctgacagca tcctggcact ggcttcagta ttctggtcca 
ttcgccttct tctacttgta caggaaaggt tacttgagtt 
tctcactatg ctgggacatt gctgctacct ctggcaggtg 
gccgctggac caacccccag taccggcagt tcatcaccat 
accagtcttc agaaaacaag aggcagcttg ccaactacaa 
cagtcgactt ccactgggaa gaacccagca gccggaagga 
gccggggtgt ggccctgctC cgcccagagc ccctgcaccg 
ccaaccgggt taagaagctg ccttgtcaga tcaccagcta 
ggcgccggat gctgtatcca ggctctgtgt acctgctgca 
tgctgcaggg ccaggcccga ctggtggaag agtgtaatgg 
cctgtgatgg caatgagatt gacaccatgt ttgtggaccg 
aggacagaag ctggcgatct gctgcgargg gaatgctggg 
ctccamgccc ctggaanctg gatattcatc ckgggtggaa 
acagggggtn ccattcccgc aaaatgaagg ctaatgccat 



240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1022 



<210> 120 
<211> 2311 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (654) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (2293) 

<223> r. equals a,t,g, or c 
<400> 120 

tcgacccacg cgtccgcttc ctcatgctgc ccgctcgcct accgttcagg ctgctgagcc 60 
ttttccttcg tggatccgct cccacggcag cgngccatgg cctccgggag ccgctcctgg 120 
agaggaggtg cgctgccgcc tcctccttcc agcactcatc gactctggga cgcgagcttc 180 
cttacgaccc cgtggacacg gagggctttg gagaaggtgg tgacatgcag gagcgttttc 240 
tgttcccgga gtacatcctg gatccggagc cgcaacccac ccgcgaaaag cagctgcagg 30 0 

agctccagca acagcaggag gaggaggagc gacagaggca gcagcggcgg gaggagcggc 360 
gacagcaaaa cctacgggcc aggtcccggg agcacccggt cgtggggcac ccggacccgg 420 
cattgccgcc cagcggcgtg aactgctcgg gctgcggggc asasctgcac tgccaggacg 480 
ccggagtgcc cggctacctg ccccgagaga agttcctccg cacggcggag gcagacggcg 540 
ggctggcacg gaccgtgtgc cagcgctgct ggctgctgtc gcaccaccgg cgcgctctac 600 
gcctgcaggt gagccgcgag cagtacctgg agctggtgag cgccgcgttg cggnggcccg 660 
gcccctccct ggtgctctac atggtggacc tgctggacct gcccgacgcc ctgctgcccg 720 
acttgcccgc gctggtgggc cccaagcagc tgatcgtgct gggaaacaaa gtggacctcc 780 
tgccccagga tgctcctggc t.accggcaga ggctgcggga gcgactgtgg gaggactgtg 840 
cccgcgccgg gctcctgctg gcccctggca ccaagggcca cagcgccccg tcaaggacga 900 
gccacaggac ggggagaazc cgaatccgcc gaactggtcc cgcacagtgg tcagggacgt 960 
gcggctgatc agcgccaaga ccggctatgg agtggaagag ttgatctctg cccttcagcg 1020 
ctcctggcgc taccgtgggg acgtctactt: agtgggcgcc accaacgccg gcaaatccac 1080 
tctctttaac acgctcctgg agtccgatta ctgcactgcc aagggctccg aggccatcga 1140 
cagagccacc atctcccctt ggccaggtac tacattaaac cttctgaagt ttcctacttg 1200 
caacccaact ccttacagaa tgtttaaaag gcatcaaaga ct.taaaaaag attcaactca 1260 
agctgaagaa gatcttagtg agcaagaaca aaatcagctt aatgtcctca aaaagcatgg 1320 
ttatgtcgta ggaagagttg gaaggacatt cLLgLaLtt;a gaag'aacaga aggataacat 1380 




1440 
150C 
1560 
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agaagcaaat 
aggaatggtt 
agcttggttt 
ggcagacgct 
aaaagaacga 
actgggggca 
aacacctaat 
gaccgtccgg 
aagtgtggcc 
aggaaagata 
aacaaaatac 
cct taaaaaa 
tgcgacgcca 



attgttttgc 
ctgtttttgg 
acagtcgtgg 
ctgtaccaga 
atggcrggat 
rctgaagcag 
tttaaggaca 
ccccctctct 
tataaaacca 
aatgtatgag 
attgaatttg 
aaaaaaaaag 
tanctcgtct 



caacacagtc 
gtgctatagg 
cttccaacat 
agcatgcagg 
ttcctcctct 
tggccgacat 
gactgcatct 
tgccatatat 
agaagcctcc 
accgaccttg 
tattaaacat 
ggcggccgct 
ataggaactg 



cattgttcca 
ccgcacagat 
cctccctgcg 
tcatacgt~a 
cgttgccgaa 
caagttttcc 
ccgaggctat 
tgttaacatc 
ttcccttatg 
ttcactccag 
ataacgcata 
ctagaggatc 
9 



agaacttttg 
ttcctgcagg 
catatcacct 
ctccagattc 
gacattatgt 
tctgcaggtt 
acacctgaag 
aaaggacagc 
tacaacgtga 
atatcaactg 
aataaagctc 
caagcttacg 



tgcttaaacc 
gaaatcagtc 
ccttggacag 
caatgggtgg 
taaaagaagg 
gggtttcagt 
gaacagtttt 
gcatcaagaa 
ggaagaagaa 
tattgaacac 
ccattcttac 
tacgcgtgca 



1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2311 



<210> 121 

<211> 1286 

<212> DKA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1284) 
<223> n equals 



a/t,g, or c 



<400> 121 
gggcgcgcgg 
agacgctgac 
gc egg gage c 
gctgctcctg 
geaaaaggeg 
gggccagcag 
cctgggatcc 
tttgaggagc 
atagatcttg 
agagttttgt 
tatttcacat 
ttggaccaag 
gaaggacttt 
tgttcagatt 
attgaagaac 
ccttggaatg 
agcaaagcta 
ttattcattt 
ttcttcatag 
ttctctcagt 
aatgttaaga 
aaaaaaaaaa 



gtgaaaggcg 
cacgttcctc 
azgcgacccc 
ctgctgcagc 
cactccggca 
gagtgcctgg 
caggtcegga 
cctggacacc 
ggaaaattgc 
tcagtggctc 
tcaatggagc 
gaagccctga 
gtgaaggaat 
acccaaaagg 
taccaaaata 
gttcacttaa 
aatatgttta 
tgettcaate 
tcacattctc 
atagcatttt 
atttttttta 
aaaaaaaaaa 



catcgatgea 
tcctcggtct 
agggccccgc 
tgcccgcgcc 
gagggaggtg 
tegagaeggg 
tggattcaaa 
caactacaag 
ggagtgtaca 
actteggcte 
tgaatgttca 
aatgaattca 
tggtgctgga 
agatgettet 
aatgctttaa 
atgacatttt 
cagaccaaag 
aaaagtggtt 
tcaacctata 
taaaaaaata 
tatctgttaa 
aaanaa 



gcctgcggcg 
cctccgcccc 
cgcctccccg 
gtcgagcgcc 
gtggacctgt 
agccctgggg 
ggagaaaagg 
cagtgttcat 
tttacaaaga 
aaatgcagaa 
ggacctcttc 
acaattaata 
ttagtggatg 
actggatgga 
-tttcatttg 
aaataagttt 
tgtgatttca 
tcaatatttt 
atttggaata 
taaaagctac 
ataaaaatna 



gcctcggagc 
cagctccgcg 
cagcggctcc 
tctgagatcc 
ataatggaat 
ccaatggcat 
gggaatgtct 
ggagttcatt 
tgcgttcaaa 
atgcatgctg 
ccattgaagc 
CLcatcgcac 
ttgctatctg 
attcagtttc 
ctacctcttt 
atgtatacat 
cactgttttt 
ttttagttgg 
ttgttgtggt 
caatcttngt 
zt tccaacaa 



geggeggage 
ctgcccggca 
gcggcctcct 
cc a aggggaa 
gtgcttacaa 
teegggtaca 
gaeggaaage 
gaattatggc 
tagtgetcta 
ccagcgttgg 
tataatttat 
ttcttctgtg 
ggttggcact 
tegcatcatt 
ctttattatg 
ctgaatgaaa 
aaatctagca 
ttagaatact 
cttttgtttt 
acaatttgta 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1286 



<210> 122 

<211> 1380 

<212> DNA 

<213> Homo sapiens 

<400> 122 



cagacccgcg gggcaaaegg actggggeca agacceggga gcgcgggcgc aaaggcacca 60 
rnrltll^ agggcgccgc gcacacggcc ttgggggttc tgegggcett cgggtgcgcg 120 
tcccgcccct agccatgggg tccgcagcgt tggagatcct gggcctggtg ctgtgcctgg 180 
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^gggctgggg gggtctgatc ctggcgtgcg ggctgcccat gtggcaggtg accgccttcc 24C 
tacraccacaa ast-nrtt-rr^r-n ^^^^^ — , . . . 



300 
360 
420 



tggaccacaa catcgtgacg gcgcagacca cctggaaggg gctgtggatg tcgtgcgtgg 
tgcagagcac gggcacatgc agtgcaaagt gtacgactcg gtgctggctc tgagcaccga 
ggtgcaggcg gcgcgggcgc ccaccgtgag cgccgtgctg ctggcgttcg ttgcgctctt 420 
cgcgaccctg gcgggcgcgc agtgcaccac ctgcgtggcc ccgggcccgg ccaaggcgcg 480 
tgtggccctc acgggaggcg tgctctaccr. gttttgcggg ctgctggcgc tcgtgccact 540 
ctgctggttc gccaacattg tcgtccgcga gttttacgac ccgcctgtgc ccgtgtcgca 600 
gaagtacgag ctgggcgcac gctgtacatc ggctgggcgg ccaccgcgcc gctcatggta 660 
ggcggctgcc tcttgtgctg cggcgcctgg gtctgcaccg gccgtcccga cctcagcttc 720 
cccgtgaagt actcagcgcc gcggcggccc acggccaccg gcgactacga caagaagaac 780 
tacgtctgag ggcgctgggc acggccgggc ccctcctgcc agccacgcct gcgaggcgtt 
ggataagccr ggggagcccc gcatggaccg cggcttccgc cgggtagcgc ggcgcgcagg 
ctcctcggaa cgtccggctc tgcgccccga cgcggcccct ggatccgctc ctgcctgcgc 
ccgcagctga ccttctcctg ccactagccc ggccctgccc ttaacagacg gaatgaagtt 
tccttttctg tgcgcggcgc tgtttccata ggcagagcgg gtgtcagact gaggatttcg 
cttcccctcc aagacgctgg ggg-cttggc tgctgcctta cttcccagag gctcctgctg 
acttcggagg ggcggatgca g&gcccaggg cccccaccgg aagatgtgta cacctggtct 
ttactccatc ggcagggccc gagcccaggg accagtgact tggcccggac ctcccggtct 1260 
TalTJ^lT Mtccccagg caaggcttgt gggcaccgga gcttgagaga gggcgggagt 1320 
gggaaggcta agaatctgct tagtaaatgg ttngaactct caaaaaaaaa aaaaaaaaaa 1380 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 



<210> 123 
<211> 3793 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (1102) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1132) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1199) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1228) 

<223> n equals a.t,g, or c 
<220> 

<22l> SITE 
<222> (1229) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1231) 

<223> n equals a,t,g. or c 
<220> 

<221> SITE 
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<222> (3176) 

<223> n equals a,t,g, or c 
<400> 123 

ggggcttcat acaggaaatc tattgctgtg tcaagttcca gagaaaagct tctgttcgtc 60 

caagttacta accaggctaa accacattaga cgtgaaggaa ggggrtagaa ggaagggagt 120 

gccccactgt tgatggggta agaggatcct gtactgagaa gttgaccaga gagggtctca 180 

ccatgcgcac agttccttct gtaccagtgt ggaggaaaag tactgagtga agggcagaaa 240 

aagagaaaac agaaatgctc tgcccttgga gaactgctaa cctagggcta ctgttgattt 300 

tgactatctt cttagtggcc gaagcggagg gtgctgctca accaaacaac tcattaatgc 360 

tgcaaactag caaggagaat catgctttag cctcaagcag tttatgtatg gatgaaaaac 420 

agattacaca gaactactcg aaagtactcg cagaagttaa cacttcatgg cctgtaaaga 480 

tggctacaaa tgctgtgctt tgttgccctc ctatcgcatt aagaaatttg atcataataa 540 

catgggaaat aatcctgaga ggccagcct^ cctgcacaaa agcctacaag aaagaaacaa 600 

atgagaccaa ggaaaccaac tgtactgatg agagaataac ctgggtctcc agacczgatc 660 

agaattcgga ccttcagatt cgtaccgtgg ccatcactca tgacgggtat tacagatgca 720 

taatggtaac acctgatggg a%tttccatc gtggatatca cctccaagtg ttagttacac 780 

ctgaagtgac cctgtttcaa aacaggaata gaactgcagt atgcaaggca gttgcaggga B40 

agccagctgc gcatatctcc tggatcccag agggcgattg tgccactaag caagaatact * 900 

ggagcaatgg cacagtgact gttaagagta catgccactg ggaggtccac aatgtgtcta ' 960 

ccgtgaactg ccacgtctcc catttgactg gcaacaagag tctgcacata gagctacttc 1020 

ctgttccagg tgccaaaaaa tcatcaaaat tatatattcc atatatcatc cttactatta 1080 

ttattttgac catcgtggga tncatctggt tgtngaaagt caatggctgc anaaaatata 1140 

aattgaataa accagaatct actccagttg ttgaggagga tgaaatgcag ccctatgcnt 1200 

tttacacaga gaagaacaat cctctctnng ntactacaaa caaggtgaag gcatctgagg 1260 

cattacaaag tgaagutgac acagacctcc atacttcata agttgttgga ctctagtacc 1320 

aagaaacaac aacaaacgag atacattata attactgtcc gattttctta cagttctaga 13 80 

atgaagactt atattgaaat taggtrttcc aaggttctta gaagacattt taatggattc 1440 

tcattcatac ccttgtataa ttggaatttt tgattcttag ctgctaccag ctagctctct 1500 

gaagasctga tgttattaca aagaaaatac atgcccatga ccaaatattc aaattgtgca 1560 

ggacagtaaa taatgaaaac caaatttcct caagaaataa ctgaagaagg agcaagtgtg 1620 

aacagtttct tgtgtatcct ttcagaatat tttaatgtac atatgacatg tgtatatgcc 1680 

tatggtatat gtgtcaattt atgtgtcccc tcacatatac catgcaccct atcttrgtca 1740 

aggcaccag^ gggaacaata cactgcatta ctgttctata catatgaaaa cctaataata 1800 

taagtcttag agatcatttt atatcatgac aagtagagct acctcattct ttttaatggt 1860 

tatataaaat tccattgtat agttatatca ttatttaatt aaaaacaacc ctaatgatgg 1920 

atatctagat tcttttaagt tttgtttatt zcttttaagt tttgtttgtg gtataaacaa 1980 

taccacatag aatgtttct- gtgcatatat ctctttgttt ttgagtatat ctgtaggata 2040 

acttccttga gtggaattgt caggtcaaag ggtttgtgca ttttactatz gatatatatg 2100 

ttaaattgtg tcaaatatat atgtcaaatt ccctccaaca ttgtttaaat gtgcctttcc 2160 

ctaaatttct attttaataa ctgtactatt cctgcttcta cagttgccac tttctctttt 2220 

taatcaacca gattaaatat gatgtgagat tataataoga attatactat ttaataaaaa 2280 

tggatttata ttcttggtca tgtttgtaag agagtgaatg cacgtgtgag aacactagct 2340 

tcctctgaac tcattatatc tccacagagg tgttgatact tgatgcctaa cagttttgca 2400 

gatgtgctac attggaattg tgzattttr.a tggtgtacat tctattgtga tatatttatt 2460 

gaataattaa tr.gtctattga ccatataagt ggcgaaaaat gcaccataga ggacatgggg 2520 

tatttattta caaactatga gctacataat aagcaagtgg ccatgggatg gcatgaccct 2580 

cccctccata tttccgtgga gcaaaatatt ggcaatgtct atgtaaacca ttgttaatat 2640 

catgaaatta tttt-aatta aaaacataag tctatttgct ccatagcaga aaaaacatga 2700 

gaagtttttt catcatgara gaaattgaaa caaaatatat tcattcttca atcataccat 2760 

c-gagattrt caagacagct attttgtctt ataagtatat ttttctccct ctagacattt 2820 

cagttactat ggortttgtc ctcaaaggga cytttagtct aattLtggga tgtaaagcta 2880 

atcttaatga cacttggcac atgatatttt gatcaagcca ttttgacttg accaaaaagc 2940 

agtgtccatt aggtttctgc atataaatat taccaagcaa tgttcacaat agacatcatt 3000 

acactgtcct tgaaatttat taattcttca tccaaccctg gttgagctga ggctcatagt 3060 

taggttcaag actatccgtt taaatattac tgaanaacaa agtaagacag tactatgctz 3120 

acctcttaac ttgataatgt caaaccaggc atgttaaatg acatcataga aaaganttca 3180 

agataactta tagaagttaa attatattgt acagaaaata attgtatgaa aatctctact 3240 

atggggctgg aacatggttg aacattagaa tgatataaaa aattatatat atnctccaaa 3300 
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tccacgctag acctgtcaaa tcagagaatc tagagattag aoctggcgtg tcagcaaggt 3360 

catccaggaa gcagaggctg agacggagtt aggtgtgatt acttacacag tcgatfcacat 3420 

tttacaaata acattttata tgtctcattt actgtgcttt ctccccatcc cattttgcat 3480 

cttttccttt gctttgctag atttgtcaat tttctctctc tttctgtctc tctctctttc 3540 

aatatctcta ataatttgaa agtaattcat cataactaaa tatctattgg ggttatgctt 3600 

cacttacaaa cttctgaaaa cggctttact gagatataat tgatatattt aagtgtacag 3660 

tttgttaaat tttgcacata tttaaaatgt ggactttggt aaatgttgac atagtrttac 3720 

atctgtgaaa ccatcagcat aatcaagata ataaacttgt ccatcacccc ccaaaaaaaa 3780 
aaaaaaaaaa aaa 



<210> 124 
<211> 370 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (370) 

<223> Xaa e<iuals stop translation 
<430> 124 

Met Leu Gly Ala Phe Val Trp Pro Ser Leu Leu Leu Leu Ala Ala Ala 
1 5 10 15 

Cys He Cys Leu Leu Thr Phe lie Asn Cys Ala Tyr Val Lys Trp Gly 
20 25 30 

Thr Leu Val Gin Asp He Phe Thr Tyr Ala Lys Val Leu Ala Leu He 
35 40 45 

Ala Val He Val Ala Gly He Val Arg Leu Gly Gin Gly Ala Ser Thr 
50 55 60 

His Phe Glu Asn Ser Phe Glu Gly Ser Ser Phe Ala Val Gly Asp He 
65 ™ 75 Qo 

Ala Leu Ala Leu Tyr Ser Ala Leu Phe Ser Tyr Ser Gly Trp Asp Thr 
85 90 95 

Leu Asn Tyr Val Thr Glu Glu He Lys Asn Pro Glu Arg Asn Leu Pro 
100 105 no 

Leu Ser lie Gly n e Ser Met. Pro He Val Thr He He Tyr He Leu 
115 120 125 

Thr Asn Val Ala Tyr Tyr Thr Val Leu Asp Ket Arg Asp He Leu Ala 
130 135 140 

Ser Asp Ala Val Ala Val Thr Phe Ala Asp Gin lie Phe Gly He Phe 
145 150 155 16C 

Asn Trp He He Pro Leu Ser Val Ala Leu Ser Cys Phe Gly Gly Leu 
165 170 175 

Asn Ala Ser H e Val Ala Ala Ser Arg Leu Phe Phe Val Gly Ser Arg 
180 185 190 

Glu Gly His Leu Pro Asp Ala He Cys Met He His Val Glu Arg Phe 



3793 
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195 200 205 

Thr Pro Val Pro Ser Leu Leu Phe Asn Gly He Met Ala Leu He Tyr 
210 215 220 

Leu Cys Val Glu Asp lie Phe Gin Leu He Asn Tyr Tyr Ser Phe Ser 
225 230 235 240 

Tyr Trp Phe Phe Val Gly Leu Ser He Val Gly Gin Leu Tyr Leu Arg 
245 250 255 

Trp Lys Glu Pro Asp Arg Pro Arg. Pro Leu Lys Leu Ser Val Phe Phe 
260 265 270 

Pro lie val Phe Cys Leu Cys Thr He Phe Leu Val Ala Val Pro Leu 
275 280 285 

Tyr Ser Asp Thr He Asn Ser Leu He Gly He Ala He Ala Leu Ser 
290 295 300 

Gly Leu Pro Phe Tyr Phe Leu He He Arg Val Pro Glu His Lys Arg 
305 310 315 320 

Pro Leu Tyr Leu Arg Arg Ser Trp Gly Leu Pro Gin Glv Thr Ser Arg 
325 330 335 

Sex Cys Val Cys Gin Leu Leu Gin Lys Trp He Trp Lys Met Glu Glu 
340 345 350 

Arg Cys Pro Ser Asn Gly He Pro Ser Leu Thr Lys His His Leu Glu 
^55 360 365 

Ser Xaa 
370 



<210> 125 
<211> 86 

<212> PRT ^ 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (86) 

<223> Xaa equals stop translation 
<400> 125 

Met Gly Phe Trp Cys Gly Cys Pro Phe Cys Leu Leu Val Val Leu Leu 
1 5 10 is 

Thr Asp Arg Thr Leu Scr Cys Arg Ser Val Gly Val Pro Cys Asn Val 
20 25 30 

Arg Cys Gin Cys Ala Pro Ala Gly Gly Cys Leu Pro Val Arg Leu Leu 
35 40 45 

Ala Gly Gin Gly Ser Gly Thr His Leu Arg Arg Gin Ser Ala Arg Ser 
50 55 SO 
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Gin lie Ser Ser Cys Met Leu Gly Glu Pro Leu Leu Ser Ser Lys Leu 
65 70 75 Q0 

Ser Asp Arg Asp He Xaa 
85 



<210> 126 
<21l> 44 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222* (44) 

<223> Xaa equals stop translation 
<40 0> 126 * 

Met Tyr Thr Lys Thr His Lys Phe Lys Phe Tyr Asn Phe Leu Ser Leu 
15 10 is 

Trp He Trp Lys He Phe Phe Leu Leu Phe Phe He Leu He Val Ala 
20 25 30 

Leu Ala Phe Pro He Pro Cys Leu Ser lie Phe Xaa 
35 40 



<210> 127 
<211> 319 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (264) 

<223> Xaa equals any of the naturally occurring L-anu.no acids 
<220> 

<221> SITE 
<222> (303) 

<223> Xaa equals any o£ the naturally occurring L-amino acids 
<400> 127 

Met Asn Thr Asp His Leu Arg Leu Thr Val Pro Asn Gly He Gly Ala 
1 5 10 15 

Leu Lys Leu Arg Glu Met: Glu His Tyr Phe Ser Gin Gly Leu Ser Val 
20 25 30 

Gin Leu Phe Asn Asp Gly Ser Lys Gly Lys Leu Asn His Leu Cys Gly 
35 40 45 

Ala Asp Phe Val Lys Ser His Gin Lys Pro Pro Gin Gly Met Glu He 
50 55 60 

Lys Ser Asn Glu Arg Cys Cys Ser Phe Asp Gly Asp Ala Asp Arg He 
65 ™ 75 8C 
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Val Tyr Tyr Tyr His Asp Ala Asp Gly His Phe His Leu Tie Asp Gly 
85 90 95 • 

Asp Lys He Ala Thr Leu He Ser Ser Phe Leu Lys Glu Leu Leu Val 
100 105 110 

Glu He Gly Glu Ser Leu Asn He Gly Val Val Gin Thr Ala Tyr Ala 
115 120 125 

Asn Gly Ser Ser Thr Arg Tyr Leu Glu Glu Val Met Lys Val Pro Val 
130 135 140 

Tyr Cys Thr Lys Thr Gly Val Lys His Leu His His Lys Ala Gin Glu 
145 150 155 160 

Phe Asp He Gly Val Tyr Phe Glu Ala Asn Gly His Gly Thr Ala Leu 
165 / 170 175 

Phe ser Thr Ala Val Glu Met Lys He Lys Gin Ser Ala Glu Gin Leu 
180 185 190 

GZ.u Asp Lys Lys Arg Lys Ala Ala Lys Met Leu Glu Asn He He Asp 
195 200 205 

Leu Phe Asn Gin Ala Ala Gly Asp Ala He Ser Asp Met Leu Val He 
210 215 220 

Glu Ala He Leu Ala Leu Lys Gly Leu Thr Val Gin Gin Trp Asp Ala 
225 230 235 240 

Leu Tyr Thr Asp Leu Pro Asn Arg Gin Leu Lys Val Gin Val Ala Asp 
245 250 255 

Arg Arg Val He Ser Thr Thr Xaa Ala Glu Arg Gin Ala Val Thr Pro 
260 265 270 

Pro Gly Leu Gin Glu Ala He Asn Asp Leu Val Lys Lys Tyr Lys Leu 
275 280 285 

Ser Arg Ala Phe Val Arg Pro Ser Gly Thr Glu Asp Val Val Xaa Ser 
290 295 300 

He Cys Arg Ser Arg Leu Thr Arg Lys Cys Arg Ser Pro Cys Thr 
305 . 310 315 



<210> 128 

<211> 4G 

<212> PRT 

<213> Homo sapiens 

<220> 

<22l> SITE 
<222> (46) 

<223> Xaa equals stop translation 
<400> 128 

tiez Asp Ket val Cys Phe Cys He Tyr Leu Gly Leu Leu Lys Phe He 
1 5 10 15 
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Ser Ala lie Phe Cys Ser Phe Ser Glu Glu Val Leu Tyr lie Ser Phe 
20 25 30 

Val Lys Cys lie Pro Lys Tyr Phe Val Glu Met Leu Leu Xaa 
35 40 45 

<210> 129 
<211> 709 
<212> PRT 
<213 > Homo sapiens 

<220> 

<221> SITE 
<222> (189) 

<223> Xaa equals any of £he naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (275) 

<223> Xaa equals any of the naturally occurring L-araino acids 
<220> 

<221> SITE 
<222> (414) 

<223> Xaa equals any of the naturally occurring L -amino acids 
<220> 

<221> SITE 
<222> (438) 

<223> Xaa equals any of the naturally occurring L-anino acids 
<220> 

<221> SITE 
<222> (641) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (643) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (696) 

<223> xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (697) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 129 

Met Ala Gly Leu Asn Cys Gly Val Ser lie Ala Leu Leu Gly Val Leu 
15 10 15 

Leu Leu Gly Ala Ala Arg Leu Pro Arg Gly Ala Glu Ala Phe Glu He 
20 25 30 
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Ala Leu Pro Arg Glu Ser Asn lie Thr Val Leu lie Lys Leu Gly THr 
35 40 45 

Pro Thr Leu Leu Ala Lys Pro Cys Tyr lie Val lie Ser Lys Arg His 
50 55 60 

lie Thr Met Leu Ser. lie Lys Ser Gly Glu Arg He Val Phe Thr Phe 
65 ™ 75 80 

Ser Cys Gin Ser Pro Glu Asn His Phe Val He Glu He Gin Lys Asn 
85 90 95 

He Asp Cys Met Ser Gly Pro Cys Pro Phe Gly Glu Val Gin Leu Gin 
100 105 110 

Pro Ser Thr Ser Leu Let?' Fro Thr Leu Asn Arg Thr Phe He Trp Asp 
115 120 125 

Val Lys Ala His Lys Ser He Gly Leu Glu Leu Gin Phe Ser He Pro 
I 30 135 140 

Arg Leu Arg Gin lie Gly Pro Gly Glu Ser Cys Pro Asp G3y Val Thr 
145 150 155 160 

His Ser He Ser Gly Arg He Asp Ala Thr Val Val Arg He Gly Thr 
165 170 175 

Phe Cys Ser Asn Gly Thr Val Ser Arg He Lys Met Xaa Glu Gly Val 
18 ° 185 190 

Lys Met Ala Leu His Leu Pro Trp Phe His Pro Arg Asn Val Ser Gly 
i95 200 205 

Phe Ser He Ala Asn Arg Ser Ser He Lys Arg Leu Cys He He Glu 
210 215 220 

Ser Val Phe Glu Gly Glu Gly Ser Ala Thr Leu Met Ser Ala Asn Tyr 
225 230 235 240 

Pro Glu Gly Phe Pro Glu Asp Glu Leu Met Thr Trp Gin Phe Val Val 
^45 250 255 

Pro Ala His Leu Arg Ala Ser Val Ser Phe Leu Asn Phe Asn Leu Ser 
260 265 270 

Asn Cys Xaa Arg Lys Glu Glu Arg Val Glu Tyr Tyr He Pro Gly Ser 
2? 5 280 2Bd 

Thr Thr Asn Pro Glu Val Phe Lys Leu Glu Asp Lys Gin ^ro Gly Asn 
290 295 3C0 

Met Ala Gly Asn Phe Asn Leu Ser Leu Gin Gly Cys Asp Gin Asp Ala 
305 31C 315 320 

Gin Ser Pro Gly He Leu Arg Leu Gin Phe Gin Val Leu Val Gin His 
325 330 335 

Pro Gin Asn Glu Sor Asn Lys He Tyr Val Val Asp Leu Ser Asn Glu 
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340 345 350 

Arg Ala Met Ser Leu Thr lie Glu Pro Arg Pro Val Lys Gin Ser Arg 
355 360 365 

Lys Phe Val Pro Gly Cys Phe Val Cys Leu Glu Ser Arg Thr Cys Ser 
370 375 380 

Ser Asn Leu Thr Leu Thr Ser Gly Ser Lys His Lys lie Ser Phe Leu 
395 390 395 400 

Cys Asp Asp Leu Thr Arg Leu Trp Met Asn Val Glu Lys Xaa lie Ser 
405 410 415 

Cys Thr Asp His Arg Tyr Cys Gin Arg Lys Ser Tyr Ser Leu Gin Val 
420 425 430 

r 

Pro Ser Asp He Leu Xaa Leu Pro Val Glu Leu His Asp Phe Ser Trp 
435 440 445 

Lys Leu Leu Val Pro Lys Asp Arg Leu Ser Leu Val Leu Val Pro Ala 
450 455 460 

Gin Lys Leu Gin Gin His Thr His Glu Lys Pro Cys Asn Thr Ser Phe 
465 470 475 430 

Ser Tyr Leu val Ala Ser Ala He Pro Ser Gin Asp Leu Tyr Phe Gly 
485 490 495 

Ser Phe Cys Pro Gly Gly Ser He Lys Gin He Gin Val Lys Gin Asn 
500 505 5io 

He Ser Val Thr Leu Arg Thr Phe Ala Pro Ser Phe Arg Gin Glu AZa 
51 5 520 525 

Ser Arg Gin Gly Leu Thr Val Ser Phe lie Pro Tyr Phe Lys Glu Glu 
530 535 540 

Gly Val Phe Thr Val Thr Pro Asp Thr Lys Ser Lys Val Tyr Leu Arg 
545 550 555 560 

Thr Pro Asn Trp Asp Arg Gly Leu Pro Ser Leu Thr Ser Val Ser Trp 
565 57C 575 

Asn He Ser Val Pro Arg Asp Gin Val Ala Cys Leu Thr Phe Phe Lys 
580 585 590 

Glu Arg Scr Gly Val Val Cys Gin Thr Gly Arg Ala Phe Met He He 
595 600 60S 

Gin Glu Gin Arg Thr Arg Ala Glu Glu He Phe Ser Leu Asp Glu Asp 
610 



615 620 



Val Leu Pro Lys Pro Ser Phe His His His Ser Phe Trp Val Asn He 

625 630 6 35 640 

Xaa Asn Xaa Ser Pro Thr Ser Gly Lys Cln Leu Asp Leu Leu Phe Ser 

645 650 655 
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Val Thr Leu Thr Pro Arg Thr Val Asp Leu Thr Val He Leu He Ala 
660 665 670 

Ala Val Gly Gly Gly Val Leu Leu Leu Ser Ala Leu Gly Leu He He 
675 680 685 

Cys Cys Val Lys Lys Lys Lys Xaa Xaa Thr Arg Gly Pro Ala Val Gly 
690 695 700 

He Tyr Asn Gly Asn 
705 

<210> 130 
<211> 415 
<212> PRT 

<213> Homo sapiens ^ 
<220> 

<221> SITE 
<222> (415} 

<223> Xaa equals stop trans lac ion 
<400> 130 

Met Thr Lys Ala Arg Leu Phe Arg Leu Trp Leu Val Leu Gly Se- Val 
15 10 15 

Phe Met He Leu Leu He He Val Tyr Trp Asp Ser Ala Gly Ala Ala 
20 25 30 

His Phe Tyr Leu His Thr Ser Phe Scr Arg Pro His Thr Gly Pro Pro 
3 5 40 45 

Leu Pro Thr Pro Gly Pro Asp Arg Asp Arg Glu Leu Thr Ala Asp Ser 
50 55 60 

Asp Val Asp Glu Phe Leu Asp Lys Phe Leu Ser Ala Gly Val Lys Gin 
65 70 75 80 

Ser Asp Leu Pro Arg Lys Glu Thr Glu Gin Pro Pro Ala Pro Gly Ser 
85 90 95 

Met Glu Glu Asn Val Arg Gly Tyr Asp Trp Ser Pro Arg Asp Ala Arg 
1°0 105 HO 

Arg Ser Pro Asp Gin Gly Arg Gin Gin Ala Glu Arg Arg Ser Val Leu 
115 120 12b 

Arg Gly Phe Cys Ala Asn Ser ser Leu Ala Phe Pro Thr Lys Glu Arg 
130 135 140 

Ala Phe Asp Asp He Pro Asn Ser Glu Leu Ser His Leu He Val Asp 
143 "-50 155 160 

Asp Arg Kis Gly Ala He Tyr Cys Tyr Val Pro Lys Val Ala Cys Thr 
165 170 175 

Asn Trp Lys Arg Val Met He Val Leu Ser Gly Ser Leu Leu His Arg 
180 185 190 



WO 00/06698 



92 



PCT/US99/17130 



Gly Ala Pro Tyr Arg Asp Pro Leu Arg lie Pro Arg Glu His Val His 
195 200 205 

Asn Ala Ser Ala His Leu Thr Phe Asn Lys Phe Trp Arg Arg Tyr Glv 
210 215 220 

Lys Leu Ser Arg His Leu Met Lys Val Lys Leu Lys Lys Tyr Thr Lys 
225 230 235 240 

Phe Leu Phe Val Arg Asp Pro Phe Val Arg Leu lie Ser Ala Phe Arg 
245 250 255 

Ser Lys Phe Glu Leu Glu Asn Glu Glu Phe Tyr Arg Lys Phe Ala Val 
260 265 270 

Pro Met Leu Arg Leu Tyr Ala Asn His Thr Ser Leu Pro Ala Ser Ala 
275 280 285 

Arg Glu Ala Phe Arg Ala Gly Leu Lys Val Ser Phe Ala Asn Phe He 
29° 295 300 

Gin Tyr Leu Leu Asp Pro His Thr Glu Lys Leu Ala Pro Phe Asn Glu 
305 310 315 320 

His Trp Arg Gin Val Tyr Arg Leu Cys His Pro Cys Gin He Asp Tyr 
325 330 335 

Asp Phe Val Gly Lys Leu Glu Thr Leu Asp Glu Asp Ala Ala Gin Leu 
340 345 350 

Leu Gin Leu Leu Gin Val Asp Arg Gin Leu Arg Phe Pro Pro Ser Tyr 
155 360 365 

Arg Asn Arg Thr Ala Ser Ser Trp Glu Glu Asp Trp Phe Ala Lys He 
370 375 380 

Pro Leu Ala Trp Arg Gin Gin Leu Tyr Lys Leu Tyr Glu Ala Asp Phe 
385 390 395 403 

Val Leu Phe Gly Tyr Pro Lys Pro Glu Asn Leu Leu Arg Asp Xaa 
4C5 410 415 

<210> 131 
<211> 242 
<212> PXT 
<213> Homo sapiens 

<400> 131 

Met Gin Leu Gly Ser Val Leu Leu Thr Acg Cys Pro Phe Trp Gly Cys 
1 5 10 15 

Phe Ser Gin Leu Met Leu Tyr Ala Glu Arg Ala Glu Ala Arg Arg Lys 
20 25 30 

Pro Asp He Pro Val Pro Tyr Leu Tyr Phe Asp Met Gly Ala Ala Val 
35 40 45 
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Leu Cys Ala Ser Phe Met Ser Phe Gly Val Lys Arg Arg Trp Phe Ala 
50 55 60 

Leu Gly Ala Ala Leu Gin Leu Ala He Ser Thr Tyr Ala Ala Tyr He 
65 ™ 75 80 

Gly Gly Tyr Val Kis Tyr Gly Asp Trp Leu Lys Val Arg Met Tyr Ser 
85 go 95 

Arg Thr Val Ala He He Gly Gly Phe Leu Val Leu Ala Ser Gly Ala 
100 105 no 

Gly Glu Leu Tyr Arg Arg Lys Pro Arg Ser Arg Ser Leu Gin Ser Thr 
115 120 125 

Gly Gin Val Phe Leu Gly He Tyr Leu He Cys Val Ala Tyr Ser Leu 
130 jas 140 

Gin His Ser Lys Glu Asp Arg Leu Ala Tyr Leu Asn His Leu Pro Gly 
145 150 155 i 6 o 

Gly Glu Leu Met lie Gin Leu Phe Phe Val Leu Tyr Gly lie Leu Ala 
155 170 175 

Leu Ala Phe Leu Ser Gly Tyr Tyr Val Thr Leu Ala Ala Gin He Leu 
180 185 190 

Ala Val Leu Leu Pro Pro Val Met Leu Leu He Asp Gly Asn Val Ala 
195 200 205 

^ 210 ^ ^ — ^ TrP ASn Gln Met LyS ^ u 



215 220 



Leu Gly Glu Ser Val Gly He Phe Gly Thr Ala Val He Leu Ala Thr 
Asp Gly 



225 ^30 235 240 



<210> 132 
<211> 313 
<=212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> {313} 

<223> Xaa equals stop translation 



<400> 132 

Met Glu Ser Leu Tyr Asp Leu Trp Glu Phe Tyr Leu Pro Tyr Leu Tyr 
15 10 15 



Ser Cys He Ser Leu Met Gly Cys Leu Leu Leu Leu Leu Cys Thr Pro 
20 25 30 

Val Gly Leu Ser Arg Met Phe Thr Val Met Gly His Leu Leu Val Lys 
35 40 j<; 
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Pro Thr i le Leu eiu Asp Leu Asp Glu Gin lie Tyr He lie Thr Leu 
50 55 go 

Glu Glu Glu Ala Leu Gin Arg Arg Leu Asn Gly Leu Ser Ser Ser Val 
65 70 75 bo 

Glu Tyr Asn He Met Glu Leu Glu Gin Glu Leu Glu Asn Val Lys Thr 
85 ~ ~ 



90 



95 



Leu Lys Thr Lys Leu Glu Arg Arg Lys Lys Ala Ser Ala Trp Glu Arg 
100 10S 110 

Asn Leu yal Tyr Pro Ala Val Met Val Leu Leu Leu lie Glu Thr Ser 
115 120 125 

lie Ser Val Leu Leu Val # la Cys Asn lie Leu Cys Leu Leu Val Asp 
130 135 140 

Glu Thr Ala Met Pro Lys Gly Thr Arg Gly Pro Gly Ile Gly Asn Ala 
150 155 1€0 

Ser Leu Ser Thr Phe Gly Phe Val Gly Ala Ala Leu Glu He lie Leu 
165 170 175 



lie Phe Tyr Leu Met Val Ser Ser Val Val Gly Phe Tyr Ser Leu Arg 
180 195 190 

Phe Phe Gly Asn Phe Thr Pro Lys Lys Asp Asp Thr Thr Met Thr Lys 
x ^ 200 205 

He lie Gly Asn Cys Val Ser He Leu Val Leu Ser Ser Ala Leu Pro 



215 220 



Val Met Ser Arg Thr Leu Gly H e Thr Arg Phe Asp Leu L*u Gly Asp 



235 240 



Phe Gly Arg Phe Asn Trp Leu Gly Asn Phe Tyr lie Val Leu Ser Tyr 
245 



250 



255 



Asn Leu Leu Phe Ala He Val Thr Thr Leu Cys Leu Val Arg Lys 



265 270 



Phe 



Thr ser Ala Val Ar g Glu Glu Leu Phe Lys Ala Leu Gly Leu His Lys 

280 285 

Leu His Leu Pro Asn Thr Ser Arg Asp Ser Glu Thr Ala Lys Pro Ser 

295 300 
Val Asn Gly His Gin Lys Ala Leu Xaa 



305 



310 



<210> 133 
<2U> 183 
<212> PPT 

<213> Homo sapiens 



<220> 
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<221> SITE 
<222> (183) 

<223> Xaa equals stop translation 
<400> 133 

Met Met Val Cys Ser He Met Met Tyr Phe Leu Leu Gly lie Thr Leu 
1 5 io 15 

Leu Arg Ser Tyr Met Gin Ser Val Trp Thr Glu Glu Ser Gin Cys Thr 
20 25 30 

Leu Leu Asn Ala Ser He Thr Glu Thr Phe Asn Cys Ser Phe Ser Cys 
35 40 45 

. Gly Pro Asp Cys Trp Lys Leu Ser Gin Tyr Pro Cys Leu Gin Val Tyr 

Val Asn Leu Thr Ser Ser Gly Glu Lys Leu Leu Leu Tyr His Thr Glu 
65 70 75 so 

Glu Thr He Lys He Asn Gin Lys Cys Ser Tyr He Pro Lys Cys Gly 
85 90 95 

Lys Asn Phe Glu Glu Ser Met Ser Leu Val Asn Val Val Met Glu Asn 

100 105 no 

Phe Arg Lys Tyr Gin Hie Phe Ser Cys Tyr Ser Asp Pro Glu Gly Asn 
115 120 125 

Gin Lys Ser Val He Leu Thr Lys Leu Tyr Ser Ser Asn Val Leu Phe 
130 135 140 

His Ser Leu Phe Trp Pro Thr Cys Vez Met Ala Gly Gly Val Ala He 
145 150 155 160 

Val Ala Met Val Lys Leu Thr Gin Tyr Leu Ser Leu Leu Cvs Glu Arg 
165 170 175 

He Gin Arg He Asn Arg Xaa 
180 



<210> 134 
<211> 147 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (147) 

<223> Xaa equals stop translation 
<400> 134 

Met Trp Lys Leu Trp Arg Ala Glu Glu Gly Ala Ala Ala Leu Gly Gly 
1 5 io 15 

Ala Leu Phe Leu Leu Leu Phe Ala Leu Gly Val Arg Gin Leu Leu Lys 
20 25 30 
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Gin Arg Arg Pro Met Gly Phe Pro Pro Gly Pro Pro Gly Leu Pro Phe 
35 40 45 

He Gly ash lie Tyr Ser Leu Ala Ala Ser Ser Glu Leu Pro His Val 
50 55 60 

Tyr Met Arg Lys Gin Ser Gin Val Tyr Gly Glu Val Gin Pro Arg Arg 
65 70 75 80 

Ala Pro Gly Arg Glu Gly Arg Gin Ala Gly Pro Gly Trp Pro Gly Pro 
85 90 95 

Ser Trp Leu Asp Leu Trp Pro Pro Leu Gly Arg Leu Val Gly Thr Ser 
100 105 110 

Pro Cys Ala Gly Cys Pro Leu Arg Asp Thr Arg Phe Pro Cly Leu Glu 
115 f 120 125 

Gly Arg Ser Pro Arg Arg Arg Ala Pro Leu Gin Gly Glu Pro Arg Pro 
130 135 i 40 

Cys Arg Xaa 
145 



<210> 135 
<211> 122 
<212> PRr 
<213> Homo sapiens 

<400> 135 

Met Arg Val Arg He Gly Leu Thr Leu Leu Leu Cys Ala Val Leu Leu 
1 5 ic 15 

Ser Leu Ala Ser Ala Ser Ser Asp Glu Glu Gly Ser Gin Asp Glu Ser 
20 25 30 

Leu Asp Ser Lys Thr Thr Leu Thr Ser Asp Glu Ser Val Lys Asp His 
35 40 45 

Thr Thr Ala Gly Arg Val Val Ala Gly Gin lie Phe Leu Asp Ser Glu 
SO 55 60 

Glu Ser Glu Leu Glu Ser Ser lie Gin Glu Glu Glu Asp Ser Leu Lys 
65 70 75 eo 

Ser Gin Glu Gly Glu Ser Val Thr Glu Asp lie Ser Phe Leu Glu Ser 
85 90 95 

Pro Asn Pro Glu Asn Lys Asp Tyr Glu Glu Pro Lys Lys Val Arg Lys 
100 105 HO 

Pro Gly ser Leu Asp lie Phe Leu Ala Phe 
H5 120 



<210> 136 
<211> 112 
<212> PRT 
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<213> Homo sapiens 
<400> 136 

Met Ala Arg Gly Ser Leu Arg Arg Leu Leu Ara Leu Leu Val Leu Gly 

Leu Trp Leu Ala Leu Leu Arg Ser Val Ala Gly Glu Gin Ala Pro Gly 
20 25 30 

Thr Ala Pro Cys Ser Arg Gly Ser Ser Trp Ser Ala Asp Leu Asp Lys 
35 40 45 

Cys «et Asp cys Ser Thr Ser Cys Pro Leu Pro Ala Ala Leu Ala His 
50 55 . 60 

* Pro Trp Gly Arg Ser Glu Pro Asp Leu Arg Ala Gly Ala Ala Phe Trp 
65 70 75 80 

Leu Phe Gly Leu Glu Thr Met Pro Gin Arg Glu Lys Phe Thr Thr Pro 
85 90 95 

He Glu Glu Thr Gly Gly Glu Gly cys Pro Ala Val Ala Leu lie Gin 
100 105 no 



<210> 137 
<211> 140 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (140) 

<223> Xaa equals stop translation 



<400> 137 

Met Leu Leu Gly Pro Val Pro lie Leu His He Lys Ser Gin Leu Trp 
1 5 10 15 

Leu Leu Val Leu H e Leu Val Val Ser Gly Leu Ser Ala Gly Met Ser 



25 30 



He He Pro Thr Phe Pro Glu lie Leu Ser Cys Ala His Glu Asn Gly 



40 



Phe Glu Glu Gly Leu Ser Thr Leu Gly Leu Val Ser Gly Leu Phe Ser 



55 60 



Ala Met Trp Ser He Gly Ala phe Met Gly Pro Thr Leu Gly Gly Phe 
65 7 ° 75 BO 

Leu Tyr Glu Lys lie Gly Phe Glu Trp Ala Ala Ala lie Gin Gly Leu 
85 90 95 

Trp Ala Leu He Ser Gly Lea Ala Met Gly Leu Phe Tyr Leu Leu Glu 
1Q 0 105 no 
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Tyr Ser Arg Arg Lys Arg Ser Lys Ser Gin Asn lie Leu Ser Thr Glu 
l- 1 ^ 120 125 

Glu Glu Arg Thr Thr Leu Leu Pro Asn Glu Thr Xaa 
130 135 140 



<210> 138 
<211> 404 
<212> PRT 

<213> Homo sapiens 
<400> 138 

Met Arc Leu Gin Asp Val Tyr Met Leu Asn Val Lys Gly Leu Ala Arg 
1 5 io 15 

Gly val Phe Gin Arg Val Thr Gly Ser Ala He Thr Asp Leu Tyr Ser 
20 25 30 

Pro Lys Arg Leu Phe Ser Leu Thr Gly Asp Asp Cys Phe Gin Val Gly 
35 40 45 

Lys Val Ala Tyr Asp Met Gly Asp Tyr Tyr His Ala He Pro Trp Leu 
50 55 60 

Glu Glu Ala Val Ser Leu Phe Arg Gly Ser Tyr Gly Glu Trp Lys Thr 
65 70 75 80 

Glu Asp Glu Ala Ser Leu Glu Asp Ala Leu Asp His Leu Ala Phe Ala 
85 go 95 

Tyr Phe Arg Ala Gly Asn Val Ser Cys Ala Leu Ser Leu Ser Arg Glu 
100 105 no 

Phe Leu Leu Tyr Ser Pro Asp Asn Lys Arg Met Ala Arg Asn Val Leu 
115 120 125 

Lys Tyr Glu Arg Leu Leu Ala Glu Ser Pro Asn His Val Val Ala Glu 
130 135 140 

Ala Val He Gin Arg Pro Asn He Pro His Leu Gin Thr Arc Asp Thr 
145 150 155 " 160 

Tyr Glu Gly Leu Cys Gin Thr Leu Gly Ser Gin Pro Thr Leu Tyr Gin 
165 170 175 

He Pro ser Leu Tyr Cys Ser Tyr Glu Thr Asn Ser Asn Ala Tyr Leu 
180 185 190 

Leu Leu Gin Pro He Arg Lys Glu Val lie His Leu Glu Pro Tyr He 
195 200 205 

Ala Leu Tyr His Asp Phe Val Ser Asp Sec Glu Ala Gin Lys He Arc 
210 215 220 

Glu Leu Ala Glu Pro Trp Leu Gin Arg Ser Val Val Ala Ser Gly Glu 
225 230 235 240 
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Lys Gin Leu Gin Val Glu Tyr Arg He Ser Lys Ser Ala Trp Leu Lys 
245 250 255 

Asp Thr val Asp Leu Lys Leu Val Thr Leu Asn His Arg lie Ala Ala 
260 265 270 

Leu Thr Gly Leu Asp Val Arg Pro Pro Tyr Ala Glu Tyr Leu Gin Val 
275 280 285 

Val Asn Tyr Gly He Gly Gly His Tyr Glu Pro His Phe Asp His Ala 
290 295 300 

Thr Ser Pro Ser Ser Pro Leu Tyr Arg Met L ys Ser Gly Asn ^ Val 
305 310 3is 3 2 o 

Ala Thr Phe Met He Tyr Leu Ser Ser Vol Glu Ala Gly Gly Ala Thr 
325 ^ 330 335 

Ala Phe He Tyr Ala Asn Leu Ser Val Pro Val Val Arg Asn Ala Ala 
34 ° 345 350 

Leu Phe Trp Trp Asn Leu His Arg Ser Gly Glu Gly Asp Ser Asp Thr 
355 360 365 

Leu His Ala Gly Cys Pro Val Leu Val Gly Asp Lys Trp Val Ala Asn 
370 375 380 

Lys Trp He His Glu Tyr Gly Gin Glu Phe Arg Arg Pro Cys Ser Ser 
385 390 395 400 

Ser Pro Glu Asp 



<210> 139 
<211> 96 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (96) 

<22 3> Xaa equals stop translation 
<400> 139 

Met Lys Ala Pro His Thr Gly Val Leu His Leu Gly Ser Val Trp Val 



10 15 



Phe Leu Gly Pro Phe Leu Leu Gly Val Gly Tyr Thr Leu Thr Phe 



20 



25 30 



Asn 



Pre Leu Ser Gly Cys Met Ser Thr 



35 



Thr Val Arg Trp Leu Asn Ser Asn He 
40 45 

Thr Ala Asn Axg Thr Leu Ser Arg Ser Val Cys His Val Thr Pro Leu 
50 55 60 

His Arg Ser Leu Ser Pro His Asp Gly Glu Tyr leu Arc, Gin Met Leu 
65 70 75 BC 
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Lgu Asn Ser Ser Ser Arg Ala Gly Glu Ala Gly Ser Tro Gly Tyr Xaa 
85 90 * 95 



<210> 140 
<211> 240 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> <240i 

<223> Xaa equals stop translation 
<400> 140 

Met Gly Ser Cys Ala Arg Leu Leu Leu Leu Trp Gly Cys Thr Val Val 
1 5 10 15 

Ala Ala Gly Leu Ser Gly Val Ala Gly Val Ser Ser Arg Cys Glu Lys 
20 25 30 

Ala Cys Asn Pro Arg Met Gly Asn Leu Ala Leu Gly Arg Lys Leu Trp 
35 40 45 

Ala Asp Thr Thr Cys Gly Gin Asn Ala Thr Glu Leu Tyr Cys Phe Tyr 
50 55 60 

Ser Glu Asn Thr Asp Leu Thr Cys Arg Gin Pro Lys Cys Asp Lys Cys 
6j 70 75 80 

Asn Ala Ala Tyr Pro His Leu Ala His Leu Pro Ser Ala Met Ala Asp 
85 go 95 

Ser Ser Phe Arg Phe Pro Arg Thr Trp Trp Gin Ser Ala Glu Asp Val 
100 105 no 

His Arg Glu Lys lie Gin Leu Asp Leu Glu Ala Glu Phe Tyr Phe Thr 
115 120 125 

His Leu He Val Met Phe Lys Ser Pro Arg ?ro Ala Ala Met Val Leu 
130 135 140 

Asp Arg Ser Gin Asp Phe Gly Lys Thr Trp Lys Pro Tyr Lys Tyr Phe 
145 150 155 160 

Ala Thr Asn Cys ser Ala Thr Phe Gly Leu Glu Asp Asp Val Val Lys 
165 170 175 

Lys Gly Ala He Cy S Thr Ser Lys Tyr Ser Ser Pro Pho Pro Cys Thr 
!80 185 190 

Gly Arg Lys Val He Phe Lys Ala Leu Ser Pro Pro Tyr Aso Thr Slu 
195 200 205 

Asn Pro Tyr Ser Ala Lys Val Gin Glu Gin Leu Lys lie Thr Asn Leu 



J 
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210 215 220 

Pro Arg Ala Ala Ala Glu Thr Thr Val Leu Ser Leu Ser Glu Lys Xaa 
225 230 235 240 



<210> 141 
<211> 54 
<212> PRT 

<213> Homo sapiens 

<220> 
'<221> SITE 

<222> (54) ^ 

<223> xaa equals stop translation 

<400> 141 

Met Met lie Ser Gly Leu Lys Leu Leu Val Leu Phe Leu Lys Phe Ala 
1 " S 10 is 

Pro Glu Asn Tyr Cys Leu Ser Thr Glu Thr Leu Gin Met Pro Asn Arg 
20 25 30 

His Leu Arg Leu Ser Lys Ala Thr Cys Tyr Leu Met Lys Cys Leu Leu 
35 40 45 

Pro Ser Tyr Phe Glu Xaa 
50 



<210> 142 
<211> 67 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (67) 

<223> Xaa equals stop translation 
<400> 142 

Ket Arg Ser Leu He Ser Ser His Pro Cys Gin His Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Phe Leu He Leu Ala He Leu Val Asp Val Lys Trp 
20 25 30 

Tyr Leu Val Leu Phe He Cys He Ser Leu Met Thr Ser Asp Val Glu 
35 40 45 

His Leu Phe Met Cys Leu Leu Ala He Arg He Ser Ser Trp Arg Asn 
50 55 60 

Val Tyr Xaa 
65 
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<210> 143 

<211> 108 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (55) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
^222> (58) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (67) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 143 

Leu ser Val Ala Gly 
15 

Arg Leu Pro Arg His 
30 

Leu Leu Lys Ala Xaa 
45 

Phe Pro Gly Cys Pro 
60 

Met Gly Asn Gly Lys 
80 

fro Gin Lys Arg Phe 
95 

Phe 



Met Phe Tyr Lys Leu Thr Leu lie Leu Cys Glu 
1 5 io 

Val Thr Gin Ala Ala Ser Gin Arg Pro Leu Gin 
20 25 

He Cys Ser Gin Arg Asn Pro Pro Gly Arg Cys 
35 40 

Leu Gin Thr Thr Trp Gly Xaa Pro Asp Xaa Gin 
50 55 

His Pro Xaa Arg Val Thr Leu Asn Ala Arg Gin 
65 70 75 

Glu Lys Lys Ala Ala Asp Leu Lys Leu Lys Phe 
85 90 

Tyr Leu Ser Ala Phe Ser Glu Arg He Lys Ala 
1 nn 



<210> 144 
<211> 84 
<212> pp/r 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (84) 

<223> xaa equals step translation 
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<400> 144 

Met Ala Ser Val Gly Thr Thr Leu Val Ser Pro Leu Leu Cys Leu Leu 
1 5 10 15 

lie Pro Thr Arg Val Ser Asp Pro Trp Leu Gin Asn Thr Pro Leu His 
20 25 30 

Pro Trp Lys Thr lie Thr lie lie Asp Tyx Tyr Leu Ser Leu Gly Phe 
35 40 45 

Leu Gly Trp Thr Gly Leu Ser Trp Val Val His Phe Gly Ala Ser Ala 
50 55 60 

Val Met Gly Arg Gin Trp Leu Gly Ser Leu Gin Arg Leu Pro Cys lie 
65 70 75 80 

Ser Gly Ser Xaa ^ 



<210> 145 
<211> 166 
<212> PRT 

< 213> Homo sapiens 
<400> 145 

Met Gly Ser Arg Phe Leu Leu Val Leu Leu Ser Gly Leu Thr Val Leu 
1 5 10 is 

Leu Ala Leu Pro Gly Ser Glu Ala Lys Asn Ser Gly Ala Ser Cys Pro 
20 25 30 

Pro cys Pro Lys Tyr Ala Ser Cys His Asn Ser Thr His Cys Thr Cys 
35 40 45 

Glu Asp Gly Phe Arg Ala Arg Ser Gly Arg Thr Tyr Phe His Asp Ser 
50 55 60 

Ser Glu Lys cys Glu Asp lie Asn Glu Cys Glu Thr Gly Leu Ala Lys 
65 70 75 80 

Cys Lys Tyr Lys Ala Tyr Cys Arg Asr. Lys Val Gly Gly Tyr He Cys 
85 90 95 

Ser Cys Leu Val Lys Tyr Thr Leu Phe Asn Phe Leu Ala Gly He He 
100 105 HO 

Asp Tyr Asp His Pro Asp Cys Tyr Glu Asn Asn Ser Gin Gly Thr Thr 
115 120 125 

Gin Ser Asn Val Asp lie Trp Val Ser Gly Val Lys Pro Gly Phe Gly 

130 



135 

Lys Gin Leu Val Arg He Thr Met Pro Phe Ser Tyr Pro Asn He Asn 

160 



145 150 



Met Ser Ser Cys Asp Phe 
165 
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<210> 146 

<2ll> 70 

<212> PRT 

<213> Homo sapiens 

<400> 146 

Met Lys Pro Lys His Leu Glu Trp Cys Leu Ala His Ser Trp Cys Val 
1 5 10 15 

He Trp Leu Ser Phe Val Ser Pro Pro Thr Ser His Leu Glu Cys Asp 
20 25 30 

Gly Phe Pro Gly Ser Leu Leu Pro Pro Cys Glu Glu Gly Arg Cys Phe 
35 40 45 

Pro Phe Thr Phe His His ,Hig Asp Cys His Gly Cys Ser Pro Leu Gin 
50 55 60 

Ser Ser Pro Gly Gin His 
65 7 0 



<210> 147 
<211> 412 
<212> PRT 

<213> Homo sapiens 
<400> 147 

Met Cys Cys Trp Pro Leu Leu Leu Leu Trp Gly Leu Leu Pro Gly Thr 
1 5 10 15 

Ala Ala Gly Gly Ser Gly Arg Thr Tyr Pro His Arg Thr Leu Leu Asp 
20 25 3C 

Ser Glu Gly Lys Tyr Trp Leu Gly Trp Ser Gin Arg Gly Ser Gin He 
35 40 45 

Ala Phe Arg Leu Gin Val Arg Thr Ala Gly Tyr Val Gly Phe Gly Phe 
50 55 60 

Ser Pro Thr Gly Ala Met Ala Ser Ala Asp He Val Val Gly Gly Val 



70 75 



80 



Ala His Gly Arg Pro Tyr Leu Gin Asp Tyr Phe Thr Asn Ala Asn Arc 
85 90 95 

Glu Leu Lys Lys Asp Ala Gin Gin Asp Tyr His Leu Glu Tyr Ala Met 

100 105 no 

Glu Asn Ser Thr His Thr He He Glu Phe Thr Arg Glu Leu His Thr 



115 



120 



cys Asp lie Asn Asp Lys Ser He Thr Asp Ser Thr Val Arg Val Tie 
1JU 135 140 

Trp Ala Tyr His His Glu Asp Ala Gly Glu Ala Gly Pro Lys Tyr His 

15 °, 155 i 60 
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Asp Ser Asn Arg Gly Thr lys Ser Leu Arg Leu Leu Asn Pro Glu Lys 
165 170 175 

Thr Ser Val Leu Ser Thr Ala Leu Pro Tyr Phe Asp Leu Val Asn Gin 
180 185 190 

Asp Val Pro He Pro Asn Lys Asp Thr Thr Tyr Trp Cys Gin Met Phe 
195 200 205 

Lys He Pro Val Phe Gin Glu Lys His His Val He Lys Val Glu Pro 
210 215 220 

Val He Gin Arg Gly His Glu Ser Leu Val His His He Leu Leu Tyr 
225 ' 230 235 240 

Gin Cys Ser Asn Asn Phe Asn Asp Ser Val Leu Glu Ser Gly His Glu 
245 250 255 



Cys Tyr Kis Pro Asn Met Fro Asp Ala Phe Leu Thr Cys Glu Thr Val 
260 265 270 

He Phe Ala Trp Ala He Gly Gly Glu Gly Phe Scr Tyr Pro Pro His 
275 280 285 

val Gly Leu Ser Leu Gly Thr Pro Leu Asp Pro His Tyr Val Leu Leu 
290 295 300 

Glu Val His Tyr Asp Asn Pro Thr Tyr Glu Glu Gly Leu He Asp Asn 
305 310 



315 



320 



Ser Gly Leu Arg Leu Phe Tyr Thr Met Asp lie Arg Lys Tyr Asp Ala 
325 330 335 

Gly Val He Glu Ala Gly Leu Trp Val Ser Leu Phe His Thr He Pro 
340 345 350 

Pro Gly Met Pro Glu Phe Gin Ser Glu Gly His Cys Thr Leu Glu Cys 
355 360 365 

Leu Glu Glu Leu Trp Lys Pro Lys Ser Gin Val Glu Phe Met Cys Leu 
370 375 3B0 

Leu Phe Phe Ser Met Leu Thr Trp Leu Ala Glu His Gin Ala Ala Ser 
385 390 395 4 00 

Phe Ser Lys Arg Glu Gly Asn Glu He Thr Cys Leu 
405 410 



<210> 148 
<211> 85 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (85) 

<223> Xaa equals stop translation 
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<400> 148 

Met Asn Val Phe Leu Pro Pro Ala Leu Gly Thx Trp Cly Val Ala Arg 
15 10 15 

Phe Phe Pro His Leu Val Pro Glu Arg Trp Cys Leu Val Phe Cys Cys 
20 25 30 

Trp lie Phe Phe Phe Phe Phe Phe Phe Cys Thr Lys Val Ala Thr Arg 
35 40 45 

Ser Val Leu Gly Asp Gin Ala Gly Leu Gly Val Gly Gly Pro His Leu 
50 55 60 

Pro Leu Pro Gly Ser His Ser Val Ser Val Pro Glu Lys Thr lie Phe 
65 70 75 80 

Ser Leu Lys Gin Xaa ;■ 
85 



<210> 149 
<211> 154 
<212> PRT 

<213> Homo sapiens 
<220> 

<22l> SITE 
<222> (154) 

<223> Xaa equals stop translation 
<400> 149 

Met Gly Arg Leu Pro Leu Leu Arg Arg Val Leu Lys Gly Leu Gin Leu 
1 5 10 15 

Leu Leu Ser Leu Leu Ala Phe lie Cys Glu Glu Val Val Ser Gin Cys 
20 25 30 

Thr Leu Cys Gly Gly Leu Tyr Phe Phe Glu Phe Val Ser Cys Ser Ala 
35 40 45 

Phe Leu Leu Ser Leu Leu lie Leu lie Val Tyr Cys Thr Pro Phe Tyr 
50 55 60 

Glu Arg Val Asp Thr Thr Lys Val Lys Ser Ser Asp Phe Tyr lie Thr 
65 70 75 80 

Leu Gly Thr Gly Cys Val Phe Leu Leu Ala Ser He lie Phe Val Ser 
85 90 95 

Thr His Asp Arg Thr Ser Ala Glu He Ala Ala He Val Phe Gly Phe 
100 105 HO 

He Ala ser Phe Met Phe Leu Leu Asp Phe He Thr Met Leu Tyr Glu 
115 120 125 

Lys Arg Gin Glu Ser Gin Leu Arg Lye Pro Glu Aan Thr Thr Arg Ala 
130 135 140 

Glu Ala Leu Thr Glu Pro Leu Asn Ala Xaa . 
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145 150 



<210> 150 
<211> 130 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITS 
<222> (130) 

<223> Xaa equals stop translation 
<400> 150 

Met Arg Gly His Leu Ala Gly Phe Pro Ala Leu Ser Gly Leu Ala Ser 
1 5 10 15 

$* 

Val Cys Leu Trp Ala Thr Phe Ser Ala Gin Leu Pro Gly Pro Val Ala 
20 25 30 

Ala Thr Ser Trp Thr Pro Ala Pro Leu Gly Cys Ser Ala Ala Arg Ser 
35 40 45 

Gly Pro Glu Lys Arg- Leu Gly Thr Ala Ala Pro Gly Ser Ala Ala Ser 
50 55 60 

Leu Ala Gin Ala Gly Pro Gly Ala Pro Cys Arg val Leu Pro Val Aso 
65 7 0 75 83 

Pro Ala Pro Ala Ala Leu Asn Val Arg Glu Pro Gly Trp Leu Gly Gly 
85 90 95 

Leu Phe Asp Gly Ala Leu Leu Gin Val Leu Leu Asn Phe Leu Arg Lys 
100 105 110 

Ser Thr Asp Val Leu Met Asp Thr Arg Glu Ala Glu Ser Leu Glu Val 
US 120 125 

Glu Xaa 
130 



<210> 151 
<211> 62 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (62) 

<223> Xaa equals step translation 
<400> 151 

Met Leu Phe Trp Ala 'Vyxr Pro lie Cys Val Phe lie Asp Ser Leu Ser 
1 5 10 15 

Cys Gin Pro Cys Leu Trp Ser Thr Gly Ala Thr Ser His Phe Asn Ser 
20 25 30 
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Pro Thr Thr Ser Pro Leu The Thr Leu Phe Met Pro Cys Ala Leu Ala 
35 40 45 

Pro Asn Pro Phe Thr Gin Leu Gly Lys Leu Asp Asp Arg Xaa 
50 55 60 



<210> 152 
<211> 225 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (225) 

<223> Xaa equals atop translation 
<400> 152 

Met Gly lie Phe Pro Gly lie He Leu He Phe Leu Arg Val Lys Phe 
3 5 io 15 

Ala Thr Ala Ala val He Val Ser Gly His Gin Lys Ser Thr Thr Val 
20 25 30 

Ser His Glu Met Ser Gly Leu Asn Trp Lys Pro Phe Val Tyr Gly Gly 
35 40 45 

Leu Ala ser He Val Ala Glu Phe Gly Thr Phe Pro Val Asp Leu Thr 
50 55 60 

Lys Thr Arg Leu Gin Val Gin Gly Gin Ser He Asp Ala Arg Phe Lys 
65 70 ' 75 80 

Glu He Lys Tyr Arg Gly Met Phe His Ala Leu Phe Arg He Cys Lys 
65 90 95 

Glu Glu Gly Val Leu Ala Leu Tyr Ser Gly He Ala Pro Ala Leu Leu 
100 105 HO 

Arg Gin Ala Ser Tyr Gly Thr He Lys He Gly He Tyr Gin Ser Leu 
11 5 120 125 

Lys Arg Leu Phe Val Glu Arg Leu Glu Asp Glu Thr Leu Leu He Asn 
13 ° 135 140 

Met He Cyo Gly Val Val Ser Gly Val He Ser Ser Thr He Ala Asn 
145 ISO 155 160 

Pro Thr Asp Val Leu Lys He Arg Met Gin Ala Gin Gly Ser Leu Phe 
165 170 175 

Gin Gly ser Met He Gly Ser Phe He Asp He Tyr Gin Gin Glu Gly 
180 185 190 

Thr Arg Gly Leu Trp Arg VaL Ser Thr Leu Phe Leu Leu Leu Ser Tyr 
195 200 205 

Thr Leu Ser Ser Tyr Asn Leu Gin Arg He Phe Phe Tyr He Lys Thr 
21 0 215 220 
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Xaa 
225 



<210> 153 

<211> 69 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (69) 

<223> Xaa equals stop translation 



<400> 153 

Met Leu Met Leu Leu Thr leu Leu Val Leu Gly Met Val Trp Val Ala 
1 5 10 



15 



Ser Ala Zle Val Asp Lys Asn Lys Ala Asn Arg Glu Ser Leu "Vr Asp 
20 25 30 " 

Phe Trp Glu Tyr Tyr Leu Pro Tyr Leu Tyr Ser Cys lie Ser Phe Leu 
35 40 45 

Gly Val Leu Leu Leu Leu Ala Ala Gly Arg Pro Gly Gly Ala Ala Val 
50 55 60 



Leu Leu Ser Leu Xaa 
65 



<210> 154 
<211> 84 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (84) 

<223> Xaa equals stop translation 
<400> 154 

Met Tyr Gly val Cys Leu Cys Val lie Val Cys Val Ser Gly Val Ser 
1 5 io 15 

Leu Cys Leu Tyr Val Trp Gly Val Ser Val Cys Asp Cys Val Ser Val 
20 25 30 

Phe Met Cys Val Cys Leu Cys Val He Phe Cys Val Tyr Gly Lys Pro 
3 5 40 45 

Arg Thr Glu His Tyr His Ser Pro His Leu Ala Lys Gin Lys Ala Phe 
. 50 55 60 

Arg Glu Met Cys Gly Arg His Asp Val Ser Ala Ala Gly n« Phe Gin 
65 7 ° 75 80 

Ser Tyr Val Xaa 
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<210> 155 
<211> 61 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (61) 

<223> Xaa equals stop translation 
<400> 155 

Met His Val Leu Leu Phe Ser Phe Leu He Pro Phe Leu Leu Leu Ser 
1 5 10 15 

Pro Val Gly Val Thr Cys Asn Ser His Met Leu Glu Arg Gin Val Ser 
20 25 30 

Trp Leu Lys Lys Arg Ser Thr Gin Ala Ser Gin Gin Phe Asn Lys Phe 
35 40 45 

Leu Arg Gly He Ser Asn Val Gly Arg He Val He Xaa 
50 55 60 



<210> 156 
<211> 84 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (B4) 

<223> Xaa equals stop translation 
<400> 156 

Met Cys Leu Leu val Glu Tyr Ser Leu Met He Leu Thr lie He Pro 
1 5 io 15 

Ser Leu Leu Ser Phe Val Leu Cys Leu Lys Gly He Lys His Gly Asn 
20 25 30 

Tyr He Phe Gin Thr Pro Leu Pro Glu Gly Tyr Gly Trti Tie Ser Ala 
35 co 45 

Met Ser Gly Leu Cys He Lys Phe Gly Arg Arg Lys Arg Arg Lys Thr 
50 55 60 

Trp Leu Leu Gin Val Gly Thr Leu Ala Thr He Asp Thr Glu Phe Ala 
65 70 75 80 

Arg Ser Cys Xaa 



<210> 157 
<211> 162 
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<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (162) 

<223> Xaa equals stop translation. 
<400> 157 

Met Ala Leu Ser Leu Thr Leu Cys ?he Val Met Phe Trp Thr Pro Asn 
1 5 10 15 

Val Ser Glu Lys lie Leu lie Asp lie He Gly Val Asp Phe Ala Phe 
20 25 . 30 

Ala Glu Leu Cys Val Val Pro Leu Arg He Phe Ser Phe Phe Pro Val 
35 £ 40 45 

Pro Val Thr Val Arg Ala His Leu Thr Gly Trp Leu Met Thr Leu Lys 
50 55 60 

Lys Thr Phe Val Leu Ala Pro Ser Ser Val Leu Arg lie He Val Leu 
65 70 75 80 

He Ala Ser Leu Val Val Leu Pro Tyr Leu Gly Val His Gly Ala Thr 
85 90 95 

Leu Gly Val Gly Ser Leu Leu Ala Gly Phe val Gly Glu Ser Thr Met 
100 105 HO 

Val Ala Tie Ala Ala Cys Tyr Val Tyr Arg Lys Gin Lys Lys Lys Met 
115 120 125 

Glu Asn Glu Ser Ala Thr Glu Gly Glu Asp Ser Ala Met Thr Asp Met 
130 135 140 

Pro Pro Thr Glu Glu Val Thr Asp He Val Glu Met Arg Glu Glu Asn 
145 150 155 160 

Glu Xaa 



<210> 158 
<211> 146 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (96) 

<223> Xaa equals any of the naturally occurring L-aroino acids 
<220> 

<221> SITE 
<222> (107) 

<223> Xaa equals any of the naturally occurring L-amino acids 



<220> 
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<221> SITE 
<222> <111) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> U15J 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (122) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 

<222> (132) I 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 158 

Met Glu Pro Gin Leu Gly Pro Glu Ala Ala Ala Leu Arg Pro Gly Trp 
1 5 10 is 

Leu Ala Leu Leu Leu Trp Val Ser Ala Leu Ser Cys Ser Phe Ser Leu 
20 25 30 

Pro Ala Ser Ser Leu Ser Ser Leu Val Fro Gin Val Arg Thr Ser Tyr 
35 40 45 

Asn Phe Gly Arg Thr Phe Leu Gly Leu Asp Lys Cys Asn Ala Cys lie 
50 55 60 - 

Gly Thr ser He Cys Lys Lys Phe Phe Lys Glu Glu He Arg Ser Asp 
65 70 75 AO 

Asn Trp Leu Ala Ser His X,eu Gly Thr Ala Ser Arg Phe Pro Leu Xaa 
85 go 95 

Ser Tyr Pro Cys Lys Leu Leu Gin Met He Xaa L.ys He Trp Xaa Pro 
100 105 110 

Cys Gly Xaa Leu Leu Thr Gly Gin Gin Xaa Ser Asn Glu He Ser Lys 
1*5 120 125 

Gin Glu He Xaa Cys Leu Leu His Pro Pro Pro Lys Asn Leu His He 
130 135 140 

Asp val 
145 



<210> 159 
<211> 143 
<212> PRT 

<213> Hono sapiens 
<220> 

<221> SITE 
<222> (143) 
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<223> Xaa equals stop translation 
<400> 159 

Met Trp Trp Ala Val Met Gly Gly Val He Gly Ser Trp Leu Ser Pro 
1 5 10 is 

Leu Ser He Ala Glu Cys Cys His Asp Leu Trp Thr Ser Gin Ser Cys 
20 25 30 

Glu His Ala Gly Ala Leu Cys Gly Asp Leu Leu Cys Ala Cys Arc Lys 
35 4Q 45 

Val Gly Val Trp Cys Ala Leu Gin Gin His Trp Trp Asn Arg Cys Val 
50 55 60 

Cys Pro His Ala Val He Arg Val His Cys Thr Gly Ala Ser Tyr Thr 
65 70 f 75 60 

Leu Gin Lys He Cys Ser Cys Asn Pro Lys Phe Met Gly Arg His Pro 
65 90 95 

His Arg Trp Gin Gin He Arg Lys Cys Ser Gin Pro Val Leu Arg Gly 
100 105 110 

Ser Arg Ala Ala Phe He Trp Val Arg Leu Ala Ala Leu Asn Phe He 
115 120 125 

Ser Ser Phe Arg Cys He ser Leu He Ser Tyr Ser Ala Phe Xaa 
1-30 135 140 



<210> 160 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (51) 

<223> Xaa equals stop translation 
<400> 160 

Met Lys Val Ser Asp Phe Asn Phe Leu He Phe Leu He Phe Ala Leu 
1 5 io is 

Phe Leu Thr Leu Glu Ala Phe Leu Lys Phe Thr Lys Arg Val Leu Ala 
20 25 30 

Val Val Gly Asn Leu Pro G1j Pro Pro He lie Lys Thr He Gly Phe 
35 . 40 45 

Leu Tyr Xaa 
50 



<210> 161 
<211> 65 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (65) 

<223> Xaa equals stop translation 
<400> 161 

Met Val Trp ser Ala Ala Pro Ala Pro Cys Cys Leu Leu Gly Val Leu 
15 10 15 

Gly Leu Val Gin Val Leu Gly Ala Gin Ala Val Gly Pro Trp Thr Ala 
20 25 30 

Ser Ala Cys Leu Gly Ala Ala Gin Ala Gin Pro Cys Arg; Pro Cys Lys 
- 35 40 45 

Glu Ser Ser Leu Arg Leuf Phe Ser Ala Ser Ala Pro Ser Net Thr His 
SO 55 60 

Xaa 
65 



<210> 162 
<211> 59 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (59) 

<223> Xaa equals stop translation 
<400> 162 

Met Glu Lys Tyr Cys Leu Gly Asn Asn Met Leu Ser Arg Phe Cys Leu 
1 5 10 15 

Phe Leu lie Met Leu Leu His lie Leu Leu Phe Leu Val lie Phe lie 
20 25 30 

Gin Arg His Thr Val Val Ser Leu Ser Lys His His Pro Phe Val Pro 
35 40 45 

Thr Asn Gly Ser Lys Ser Tyr Ser Ser Phe Xaa 
50 5S 



<210> 163 

<211> 374 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (84) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
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<222> (112) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 163 

Met Arg Pro Gly Thr Ala Leu Gin Ala Val Leu Leu Ala Val Leu Leu 
15 10 15 

Val Gly Le» Arg Ala Ala Thr Gly Arg Leu Leu Ser Gly Gin Pro Val 
20 25 .30 

Cya Arg Gly Gly Thr Gin Arg Pro Cys Tyr Lys Val lie Tyr Phe His 
35 40 45 

Asp Thr Ser Arg Arg Leu Asn Phe Glu Glu Ala Lys Glu Ala Cys Arg 
50 55 60 

Arg Asp Gly Gly Gin Leu^Val Ser lie Glu Ser Glu Asp Glu Gin Lys 
65 70 75 B0 

Leu He Glu Xaa Phe He Glu Asn Leu Leu Pro Ser Asp Gly Asp Phe 
85 go 95 

Trp He Gly Leu Arg Arg Arg Glu Glu Lys Gin Ser Asn Ser Thr Xaa 
100 105 110 

Cys Gin Asp Leu Tyr Ala Trp Thr Asp Gly Ser He Ser Gin Phe Arg 
115 120 125 

Asn Trp Tyr Val Asp Glu Pro Ser Cys Gly Ser Glu Val Cys Val Val 
130 135 140 

Met Tyr His Gin Pro Ser Ala Pro Ala Gly He Gly Gly Pro Tyr Met 
145 150 155 160 

Phe Gin Trp Asn Asp Asp Arg Cys Asn Met Lys Asn Asn Phe He Cys 
165 170 175 

Lys Tyr Ser Asp Glu Lys Pro Ala Val Pro Ser Arg Glu Ala Glu Gly 
180 iQ5 190 

Glu Glu Thr Glu Leu Thr Thr Pro Val Leu Pro Glu Glu Thr Gin Glu 
195 200 205 

Glu Asp Ala Lys Lys Thr Phe Lys Glu Ser Arg Glu Ala Ala Leu Asn 



210 



215 220 



Leu Ala Tyr He Leu He Pro Ser He Pro Leu Leu Leu Leu Leu Val 

225 230 235 240 

Val Thr Thr Val Val Cys Trp Val Trp He Cys Arg Lys Arg Lys Arg 
245 250 255 

Glu Gin Pro Asp ?ro Ser Thr Lys Lys Gin His Thr He Trp Pro Ser 
260 265 270 

Pro His Gin Gly Asn Ser Pro 



275 



Asp Lsu Glu Val Tyr Asn val He Arg 
280 285 

Lys Gin Ser Glu Ala Asp Leu Ala Glu Thr Arg Pro Asp Lou Lys Asn 
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29° 295 300 

lie ser Phe Arg Val Cys 5er Gly Glu Ala Thr Pro Asp Aso Met Ser 
305 310 315 ~ 320 

Cys Asp Tyr Asp Asn Met Ala Val Asn Pro Ser Glu Ser Gly Phe Val 
325 330 335 

Thr Leu Val Ser Val Glu Ser Gly Phe Val Thr Asn Asp lie Tyr Glu 
340 345 350 

Phe Ser Pro Asp Gin Met Gly Arg Ser Lys Glu Ser Gly Trp Val Glu 
355 360 365 

Asn Glu lie Tyr Gly Tyr 
370 



<210> 164 
<211> 64 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (64) 

<223> Xaa equals stop translation 
<400> 164 

Met His Pro Gin Leu He Pro Ser Val He Ala Val Val Phe He Leu 
15 10 15 

Leu Leu Gly val Cys Phe He Ala Ser Cys Leu Val Thr His His Asn 
20 25 30 

Phe Ser Arg Cys Lys Arg Gly Thr Gly Val His Lys Leu Glu His His 
35 40 45 

Ala Lys Leu Lys Cys He Lys Glu Lys Ser Glu Leu Lys Ser Cys Xaa 
50 55 60 



<210> 165 
<211> 743 
^212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (743) 

<223> Xaa equals stop translation 
<400> 165 

Met Ala Val Arg Glu Leu Cys Phe Pro Arg Gin Arg Gin Val Leu Phe 
1 5 io 15 
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Leu Phe Leu Phe Trp Gly Val Ser Leu Ala Gly Ser Gly Phe Gly Arg 
20 25 30 

Tyr Ser Val Thr Glu Glu Thr Glu Lys Gly Ser Phe Val Val Asn Leu 
35 40 45 

Ala Lys Asp Leu Gly Leu Ala Glu Gly Glu Leu Ala Ala Arg Gly Thr 
50 55 60 

Arg Val Val Ser Asp Asp Asn Lys Gin Tyr Leu Leu Leu Asp Ser His 
65 70 75 60 

Thr Gly Asn Leu Leu Thr Asn Glu Lys Leu Asp Arg Glu Lys Leu Cys 
85 90 95 

" Gly Pro Lys Glu Pro Cys Met Leu Tyr Phe Gin He Leu Met Asp Asp 
100 $ 105 11Q 

Pro Phe Gin He Tyr Arg Ala Glu Leu Arg Val Arg Asp He Asn Asp 
115 120 125 

His Ala Pro Val Phe Gin Asp Lys Glu Thr Val Leu Lys He Ser Glu 
130 135 140 

Asn Thr Ala Glu Gly Thr Ala Phe Arg Leu Glu Arg Ala Gin Asp Pro 
145 150 155 160 

Asp Gly Gly Leu Asn Gly He Gin Asn Tyr Thr He Ser Pro Asn Ser 
165 170 175 

Phe Phe His He Asn He Ser Gly Gly Asp Glu Gly Met He Tyr Pro 
180 185 190 

Glu Leu Val Leu Asp Lys Ala Leu Asp Arg Glu Glu Gin Gly Glu Leu 
195 200 205 

Ser Leu Thr Leu Thr Ala Leu Asp Gly Gly Ser Pro Ser Arg Ser Gly 
210 215 220 

Thr Ser Thr Val Arg He Val Val Leu Asp Val Asn Asp Asn Ala Pro 
225 230 235 240 

Gin Phe Ala Gin Ala Leu Tyr Glu Thr Gin Ala Pro Glu Asn Ser Pro 
245 250 255 

He Gly Phe Leu He Val Lys Val Trp Ala Glu Asp Val Asp Ser Gly 
260 265 270 

Val Asn Ala Glu Val Ser Tyr Ser Phe Phe Asp Ala Ser Glu Asn He 
275 280 285 

Arg Thr Thr Phe Gin He Asn Pro Phe Ser Gly Glu He Phe Leu Arg 
29 ° 295 300 

Glu Leu Leu Asp Tyr Glu Leu Val Asn Ser Tyr Lys He Asn He Gin 
305 310 31S ' 320 

Ala Met Asp Gly Gly Gly Leu Ser Ala Arg Cys Arg Val Leu Val Glu 
325 330 335 
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Val Leu Asp Thr Asn Asp Asn Pro Pro Glu Leu lie Val Ser Ser Phe 
340 345 350 

Ser Asn Ser Val Ala Glu Asn Ser Pro Glu Thr Pro Leu Ala Val Phe 



355 



360 3 6 5 



Lys lie Asn Asp Arg Asp Ser Gly Glu Asn Gly Lys Met Val cys lyr 
370 375 380 

lie Gin Glu Asn Leu Pro Phe Leu Leu Lys Pro Ser Val Glu Asn Phe 
385 390 395 400 

Tyr He Leu He Thr Glu Gly Ala Leu Asp Arg Glu He Arg Ala Glu 
405 410 4 15 

Tyr Asn lie Thr Ha Thr , Val Thr Asp Leu Gly Thr Pro Arg Leu 



425 430 



Lys 



Thr Glu His Asn He Thr Val Leu Val Ser Asp Val Asn Asn Asn Ala 



435 440 445 

Pro Ala Phe Thr Gin Thr Ser Tyr Thr Leu Phe Val Arg Glu Asn Asn 
4S0 455 460 

Ser Pro Ala Leu His lie Gly Ser Val Ser Ala Thr Asp Arg Asp Ser 
470 475 480 

Gly Thr Asn Ala Gin Val Thr Tyr Ser Leu Leu Pro Pro Gin Asp Pro 
4 85 490 495 

His Leu Pro Leu Ala Ser Leu Val Ser He- Asn Ala Asp Asn Gly His 
500 505 510 

Leu Phe Ala Leu Arg Ser Leu Asp Tyr Glu Ala Leu Gin Ala Phe Glu 



520 



Phe Arg Val Gly Ala Thr Asp Arg Gly Ser Pro Ala Leu Asn Ser Glu 
530 535 540 

Ala Leu Gly Ala Arg Ala Gly Ala Gly Arg Gin Arg Gin Leu Ala Leu 
545 550 555 5S0 

Arg Ala Val Pro Ala Ala Glu Arg Leu Arg Ala Leu His Arg Ala Gly 
565 570 575 

Ala Pro Gly Gly Arg Ala Gly Leu Pro Gly Asp Gin Gly Gly Gly Gly 
580 



585 5 9 o 



Gly Arg Arg Leu Gly Pro Glu Arg Leu Ala Val Val Pro Ala Ala Gin 
595 €00 60S 

Gly His Gly Ala Arg Ala Val Arg Cys Val Gly Ala Glr. Trp Gly Glv 
610 615 620 

Ala His Arg Gin Ala Ala Glu Arg Ala Arg Arg Ser Gin Ala Gin Ala 

63 <> 635 6 40 

Gly Gly Ala cys Gin Gly Gin Trp Arg Ala Ser Ser Leu Gly His Arg 
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645 ( 650 655 

His Ala Ala Arg Ala Pro Gly Gly Arg Leu Leu Pro Ala Leu Pro Ala 
660 665 6 7o 

Ser Pro Gly Gly Gly Pro Gly Pro Gly Pro Gly Arg Leu Ala H s Ara 
675 . 680 685 

Leu Pro Gly Gly Gly Val Gly Leu Gly Val Phe Ala Leu Pro Pro Leu 
690 695 700 

Gly Ala Pro Val Arg Gly Gly Ala Ala Val Gin Glu Glu Gin Gly civ 
705 ™> 715 12 q 

Leu Gly Gly Ser Leu Leu Gly Ala Arg Gly Ser Phe Ser Arg Ala Ser 
725 730 735 

Gly Gly Arg Glu Gly Arg Xaa 
740 



<210> 166 
<211> 214 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (214) 

<223> Xaa equals stop translation 
<400> 166 

Met Asn Arg Met Glu Leu Leu Lys Leu Leu Leu Thr Cys Phe Ser Glu 
1 5 10 is 

Ala Ket Tyr Leu Pro Pro Ala Pro Glu Ser Gly Ser Thr Asn Pro Trn 
20 25 30 

Val Gin Phe Phe Cys Ser Thr Glu Asn Arg His Ala Leu Pro Leu Phe 
35 40 45 

Thr Ser Leu Leu Asn Thr Val Cys Ala Tyr Asp Pro Val Gly Tyr Gly 
50 55 60 

lie Pro Tyr Asn His Leu Leu Phe Ser Asp Tyr Arg Glu Pro Leu Val 



65 70 75 



80 



GlivGlu Ala Ala Gin Val Leu He Val Thr Leu Asp His Asp Ser Ala 
85 90 95 

Ser Ser Ala Ser Pro Thr Val Asp Gly Thr Thr Thr Gly Thr Ala Met 
100 105 no 

Asp Asp Ala Asp Pro Pro GLy Pro Glu Asn Leu Phe Val Asn Tyr Leu 
115 120 125 

Ser Arg He His Arg Glu Glu Asp Phe Gin Phe He Leu Lys G\ He 
130 135 140 
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120 

Ala Arg Leu Leu Ser Asn Pro Leu Leu Gin Thr Tyr Leu Pro Asn Ser 
145 150 155 -!60 

Thr Lys Lys Asp Pro Val Pro Pro Gly Ala Ala Ser Ser Leu Leu Glu 
165 170 175 

Ala Leu Arg Leu Gin Gin Glu lie Pro Leu Leu Arg Ala Glu Glu Gin 
180 las 190 

Arg Arg Pro Arg His Pro Cys Pro His Pro Leu Leu Pro Gin Arg Cys 
195 200 205 

Pro Gly Arg Ser Val Xaa 
210 



<210> 167 
<211> 213 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (213) 

<223> Xaa equals scop translation 
<400:> 167 

Met Pro Ser Leu Arg Phe Leu Ala Leu Ala Leu Leu Leu Ala lie Leu 
1 5 io is 

Pro Ala Leu Pro Asn Ala His Ala Ala Pro Gly lie Gly Gly Leu He 
20 25 30 

Gly Gly Gly Ser Gin Ala Ser Ala Lys Glu Glu Pro Gin Ser Asn Ala 
35 4 o 45 

Gin Pro Ser Ala Asp Glu Arg Lys Gin Arg Leu Leu Ser Gin Ala Glu 
50 55 60 

Glu Thr Arg Gin Arg Leu Thr Asp Leu Lys Ala Glu Leu Ala Gly Ala 
65 70 75 80 

Pro Lys Glu lie Ser Glu Ala Gin Arg Thr Leu Ser Lys Leu Val Ser 
85 90 95 

Glu Asp Asn Ser Asp Leu Pro Glu Arg Leu Ser Lys Leu Ser Val Pro 
100 105 no 

Val Leu Glu Gin Arg Leu Ala Ala Arg Val Asp Glu Leu Ala Leu Trp 
11 5 120 125 

Gin Gin Ala Leu Ser Ala Ala Asn Ser Met Leu He Ser Ala Gin Thr 
130 135 140 

Arg Pro Glu Arg Ala Gin Ala Asp lie Ser Lys Asn Gin Leu Arg He 
145 150 155 160 

Asp Glu lie. Asn Gly Leu Leu Lys Ser Gly Arg Glu Asn Asn Lys Pro 
165 170 175 
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leu Thr Asp Glu Arg Arg Ala Leu 
180 

Ala Gly Pro Ser lie Phe His Pro 

195 200 

Gin Phe Ala Leu Xaa 
210 



Leu Glu Ser Thr Ser Arg Ala Ala 
185 190 

Gly Gly Val Pro Gly Lys Cys Thr 
205 



<210> 168 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE * 
<222> (75) 

<223> Xaa equals stop trans lac ion 
<400> 168 

Met Phe Thr Scr Phe Gly Leu Ala Ser Pro Arg lie Leu Phe Cys Phe 
1 5 io is 

Cys Phe Phe Asp Leu Gly Phe lie Phe Phe Cys Val Leu Tyr Tyr He 
20 25 30 

Val Lys Gly He Leu Ala Glu Thr Leu Val Phe Gly Ala Arg Gly Glu 
3 5 40 45 

Gin Glu Cys Trp Ala Val Tyr Phe Arg Trp Arg Thr His Leu Gin Thr 
50 55 60 

Phe Gly Leu Phe Ser Phe Asn Cys Ser Val Xaa 
65 70 75 



<210> 169 
<211> 48 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals stop translation 
<400> 169 

Met Phe Leu Cys Leu Phe Phe Phe Phe Phe Asn Ala Thr Gin Gly Asn 
IS io 15 

He Phe He Ser Phe Leu Ser Gly Leu Pro Gin Cys He Phe He Ser 
20 25 30 

Phe Glu Thr Lys Arg Phe Trp Lys Leu Phe Phe Cys Ser Phe Lys Xaa 
35 40 45 
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«:210> L70 

<211> 88 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (88) 

<223> Xaa equals stop translation 



<400> 170 

Met Gly Leu His Leu Arg Pro Tyr Arg Val Gly Leu Leu Pro Asp Gly 
15 10 15 

■f 

Leu Leu Phe Leu Leu Leu Leu Leu Met Leu Leu Ala Asp Pro Ala Leu 
20 25 30 

Pro Ala Gly Arg His Pro Pro Val Val Leu Val Pro Gly Asp Leu Gly 
35 40 45 



Asn Gin Leu Glu Ala Lys Leu Asp Lys Pro Thr Val Val His Tyr leu 

50 55 60 

Cys Ser Lys Lys Thr Glu Scr Tyr Phe Thr lie Trp Leu Asn Leu Glu 

65 70 75 80 



Leu Leu Leu Pro Val His His Xaa 
85 



<210> 171 
<211> 42 
<212> PRT 

<2i3> Homo sapiens 
<220> 

<221> SITE 
<222> (42) 

<223> Xaa equals stop translation 
<400> 171 

Met Ala Cys Glu Thr His Gly Val Leu Val Pro Ala His Leu Ser Gly 
15 10 15 

Leu lie Thr Cys Leu Leu Ala Phe Trp Val Pro Ala Ser Cys lie Gin 
20 25 30 

Arg Cys Ser Gly Ser Pro Leu Pro Leu Xaa 
35 40 



<210> 172 

<211> 48 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals stop translation 
<400> 172 

Met Gin Cys Phe Leu Phe Ser He Phe Leu He Thr Gly Leu Ala Glu 
1 5 10 15 

Glu Phe Cys Glu Gin Leu Ser lie Ser Leu Ala Glu Glu Glu He Gin 
20 25 30 

Leu Ser Ser Thr Val Glu His Phe Cys Met Thr Ala Phe Ser Trp Xaa 
35 40 45 



<210> 173 
<211> 233 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (233) 

<223> Xaa equals stop translation 
<400> 173 

Met Ala Ala Leu Ala Ala Ala Ala Lys Lys Val Trp Ser Ala Arg Arg 
15 10 15 

Leu Leu Val Leu Leu Phe Thr Pro Leu Ala Leu Leu Pro Val Val Phe 
20 25 30 

Ala Leu Pro Pro Lys Glu Gly Arg Cy S Leu Phe Val He Leu Leu Met 
35 40 45 

Ala Val Tyr Trp Cys Thr Glu Ala Leu Pro Leu Ser Val Thr Ala Leu 
50 55 60 

Leu Pro He Val Leu Phe Pro Phe Met Gly He Leu Pro Ser Asn Lys 
65 70 7c 1 

u '5 B0 

Val Cys Pro Gin Tyr Phe r.eu Asp Thr Asn Phe Leu Phe Leu Ser Gly 
85 90 g5 

Leu lie Met Ala Ser Ala He Glu Glu Trp Asn Leu His Arc; Arg He 
100 105 no 

Ala Leu Lys He Leu Met Leu Val Gly Val Gin Pro Ala Arg Leu He 



115 



120 125 



Leu Gly Met Met Val Thr Thr Ser Phe Leu Ser Met Trp Leu Ser 



135 



Thr Ala Ser Thr Ala Met Met Leu Pro He Ala Asn Ala He Leu Lys 
145 150 155 160 
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Ser Leu Phe Gly Gin Lys Glu Val Arg Lys Asp Pro Ser Gin Glu Ser 
165 170 175 

Glu Glu Asn Thr Gly lie Glu Pro Asa Thr Phe Leu Ser Glu G u Ara 
180 185 190 

Leu Lys Leu Gin Ala Pro Leu Val lie Ar g Leu Gly Gin lie Thr Glu 
195 200 205 

Ser Gly Gin Trp Asn Met Ser Gly Asn Asp Val Cys Asn Phe Arg Val 
210 215 220 

Leu Ser Phe Leu Pro Gly Gly Met Xaa 
,225 230 

<210> 174 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals stop translation 
<400> 174 

hr lie Phe Gly Tyr Leu His Cys Val Lys Cys Tyr 

15 



Met Gly Thr lie Phe Gly Tyr Leu His Cys Val Lys Cys Tyr Val Leu 
1 5 io i 5 

Tyr Phe lie Phe lie Leu lie Thr Ala Val Tyr His Ser Phe Tyr Tyr 
20 25 30 

Pro His Tyr Arg Gly Lys Ala Leu lie Ser Gly Thr Xaa 
35 40 



45 



<210> 175 

<211> 85 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (77) 

<223> Xaa equals any of the naturally occurring L-amino acids 
^220> 

<221> SITE 
<222> (85) 

<223> Xaa equals stop translation 
<400> 175 

Met Val Trp Phe Leu Phe Leu Val Phe lie Phe Leu Lys Val Lys Gly 
1 5 io 



IS 



Asp Phe Phe Pro Pro Phe Leu He Cys Asn Leu Phe Cys He Trp Met 
20 25 30 
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He Thr Gly Val Ser His Arg Leu Gin Pro Gin He Leu Phe Ser Ara 
35 40 45 

His Lys His Asn Gin Glu He He Leu Gin Met Val Ser Phe Ser Cvs 
50 55 so 

Cys Val Phe Phe Pro Met lie Arg Glu Val Lys Ser Xaa Leu Gly Cys 
65 7 <> 75 80 

He Lys Met Ser Xaa 
85 



<210> 176 
<211> 66 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (66) 

<223> Xaa equals stop translation 
<400> 176 

Mat Trp Val Leu Leu Ser Cys Pro Leu Pro Pro Leu Cys Leu Pro Ala 
15 10 15 

Ser Ala Val Pro Gly Gin Cys Leu Gly Gly Gin Trp Ser Gly His Gin 
20 25 30 

Leu Arg Leu Arg Gly Arg Gly Trp His Cys Arg Cys His Cys Arg Ala 
35 40 45 

Trp. Ala Ala Asp Met Gly Arg Gly Leu His Ser Cys Gin Leu Leu Ser 
50 55 60 

Arg Xaa 
65 



<210> 177 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<22i> SITE 
*222> (55) 

<223> Xaa equals stop translation 
<400> 177 

Met Leu Leu Leu Cys He Leu Leu He Phe Cys Val Val Gly Leu Ser 
1 5 iQ . 15 

Val Val Gly Arg Arg Val Leu Lys Ser Thr Thr He He Val Tyr Leu 
20 25 30 

Ser lie Thr Pro Phe Ser Ser Phe Ser Ser He Ser His He Phe Gin 
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35 



40 



45 



^eu Leu He Gly Ala His Xaa 
50 55 



<210> 178 

<211> 83 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (4) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> ^ 
<221> SITE 
<222> (83) 

<223> Xaa equals stop translation 
<400> 178 

Met cys Val Xaa Leu Ser Phe Cys Pro Phe Leu Ser Ser Ala Leu Pro 
1 5 10 15 

Ala Ser His Thr Gin Phe Tyr Met Pro Arg Gly Ala Lys Phe Gly "hr 
20 25 30 

Phe Thr Leu Gin Ala Ser Val Ser Pro Leu Glu Glu Lys Thr His Ser 
35 40 45 

Phe Thr His Pro Gly He Gly Gly Lys Leu Leu Gly His Gin Asp Pro 
50 55 60 

Gly Ala pro Gly Pro Ser Trp Asn He Arq Ser Thr Trp Ser Thr Arg 
65 7 <> 75 bo 

Ser Leu Xaa 



<210> 179 
<211> 330 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (38) 

<223> xaa equals any of the naturally occurring L-amino acids 
<220> 

<22l> SITE 
<222> (247) 

<223> xaa equals any of the naturally occurring L-anino acids 
<400> 179 

Met ser Pro Leu Ser Ala Ala Arg Ala Ala Leu Arg Val Tyr Ala Val 
1 5 10 15 
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Gly Ala Ala Val lie Leu Ala Gin Leu Leu Arg Arg Cys Arg Gly Gly 
20 25 30 

Phe Leu Glu Pro Val Xaa Pro Pro Arg Pro Asp Arg Val Ala lie Val 
35 $0 45 

Thr Gly Gly Thr Asp Gly He Gly Tyr Ser Thr Ala Lys His Leu Ala 
50 55 60 

Arg Leu Gly Met His Val He He Ala Gly Asn Asn Asp Ser Lys Ala 
65 70 75 80 

Lys Gin Val Val Ser Lys He Lys Glu Glu Thr Leu Asn Asp Lys Val 
85 90 95 

Glu Phe Leu Tyr Cys Asp £eu Ala Ser Met Thr Ser He Arg Gin Phe 
100 105 HO 

Val Gin Lys Phe Lys Met Lys Lys He Pro Leu His Val Leu TIa * S n 
115 120 125 

Asn Ala Gly Val Met Met Val Pro Gin Arg Lys Thr Arg Asp Gly Phe 
130 135 140 

Glu Glu His Phe Gly Leu Asn Tyr Leu Gly His Phe Leu Lsu Thr Asn 
145 150 155 160 

Leu Leu Leu Asp Thr Leu Lys Glu Ser Gly Ser Pro Gly His Ser Ala 
165 170 175 

Arg Val Val Thr Val Ser Ser Ala Thr His Tyr Val Ala Glu Leu Asn 
180 185 190 

Met Asp Asp Leu Gin Ser Ser Ala Cys Tyr Ser Pro His Ala Ala Tyr 
195 200 205 

Ala Gin Ser Lys Leu Ala Leu Val Leu Phe Thr Tyr His Leu Gin A^g 
2i0 215 220 

Leu Leu Ala Ala Glu Gly Ser His Val Thr Ala Asn Val Val Asp Pro 
225 230 235 240 

Gly val val Asn Thr Asp Xaa Tyr Lys His Val Phe Trp Ala Thr Arg 
245 250 255 

Lou Ala Lys Lys Leu Leu Gly Trp Leu Leu Phe Lys Thr Pro Asp Glu 
26 0 265 270 

Gly Ala Trp Thr Ser He Tyr Ala Ala Val Thr Pro Glu Leu Glu Gly 
275 280 285 

Val Gly Gly Arg Tyr Leu Tyr Asn Glu Lys Glu Thr Lys Ser Leu His 
290 295 300 

Val Thr Tyr Asn Gin Lys Leu Cln Gin Gin Leu Trp Ser Lys Ser Cys 
305 310 315 320 

Glu Met Thr Gly Val Leu Asp Val Thr Leu 
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325 330 



<210> 1B0 
<:211> 41 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals stop translation 
<400> 180 

Met He Ala Cys Gin Tyr He Ser Leu Ala lie Met Leu Ala Phe Val 
1 5 10 15 

Arg Trp Ala Ala Phe Leu Leu Phe Pro Phe Leu Cys Gly Asp Asn Gly 
20 25 30 

Gly Asn He Gin Gin Lys Tyr Val Xaa 
35 40 



<210> 181 
<211> 52 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (52) 

<223> Xaa equals stop translation 
<400> 181 

Met Ala Asn Ala Met Ala Tyr Leu Ser He Phe Leu Cys Gly Ala Ser 
1 5 io 15 

Ser Ser Pro Cys Asp Cys Ala Leu Leu Val Pro Val Ser Leu Phe Arq 
20 25 30 

Gly Arg Lys Val Ala Asn Phe Lys Asn Gin Asn Ser Asp Val Thjr Se- 
35 40 45 

Gly Asn Ala xaa 
50 



<210> 182 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (55) 

<223> Xaa equals stop translation 
<400> 182 



WO 00/06698 



129 



PCTAJS99/17130 



Met Gin Gin lie Cys Ser Cys Leu Gly Ala Phe Ala Leu Leu Phe Phe 
1 5 io 15 ■ 

Trp Pro Gly His Phe Thr Ser Thr Phe Ser He Phe Tyr Asp Phe Leu 
20 25 30 

Pro He Phe Gly Ser Leu Phe Lys Cys His Pro Ser Lys Arg Pro Ser 
35 40 45 

Lys Leu Pro Tyr Leu Lys Xaa 
50 55 



<210> 183 
<211> 62 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (62) 

<223> Xaa equals stop translation 
<400> 183 

Met Arg Leu Leu Leu Glu Trp Arg Val Tyr Leu Arg Leu Thr Cys Ala 
15 10 is 

Thr Lys Asp Gly Met Ala Arg Glu Cys Pro Thr Thr Trp Leu Ser Pro 
20 25 30 

Pro Ala Lys Pro Asp Phe Ala Gin Arg His Ser Val Lys Pro Thr Ala 
35 40 45 

Leu Gin Gly Gly Arg Trp Ser Arg Leu Gly Ala Ser Pro Xaa 
50 55 60 



<210> 184 
<211> 148 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (148) 

<223> Xaa equals stop translation 
<400> 184 

Met Leu Gly Leu Pro Trp Lys Gly Gly Leu Ser Trp Ala Leu Leu Leu 
1 5 10 1S 

Leu Leu Leu Gly Ser Gin He Leu Leu lie Tyr Ala Trp His Phe His 
20 25 30 

Glu Gin Arg Asp Cys Asp Glu His Asn Val Met Ala Arg Tyr Leu Pro 
35 40 45 

Ala Thr Val Glu Phe Ala Val His Thr Phe Asn Gin Gin Ser Lys Asp 
5 0 55 60 
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Tyr Tyr Ala Tyr Arg Leu Gly His lie Leu Asn Ser Trp Lvs Glu Gin 
6S 70 75 " 80 

Val Glu Ser Lys Thr Vai Phe Ser Met Glu Leu Leu Leu Gly Arg Thr 
85 90 95 

Arg Cys Gly Lys Phe Glu Asp Asp He Asp Asn Cys His Phe Gin Glu 
100 105 no 

Ser Thr Glu Leu Asn Asn Thr Phe Thr Cys Phe Phe Thr Tie Ser Thr 
US 120 125 

Arg Pro Trp Met Thr Gin Phe Ser Leu Leu Asn Lys Thr Cys Leu Glu 
130 135 140 



Gly Phe His Xaa 
145 



<210> 185 
<211> 161 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (146) 

<223> Xaa equals any of the naturally occurring L-aiuino acids 
<220> 

<221> SITE 
<222> (151) 

<223> Xaa equals any of the naturally occurring L- ami no acids 
<220> 

<221> SITE 
<222* (161) 

<22 3> Xaa equals stop translation 
<400> 185 

Met Arg Leu Leu Cys Gly Leu Trp Leu Trp Leu Ser Leu Leu Lys Val 
1 5 io is 

Leu Gin Ala Gin Thr Pro Thr Pro Leu Pro Leu Pro Pro Pro Met Gin 
20 25 30 

Ser Phe Gin Gly Asn Gin Phe Gin Gly Glu Trp Phe Val Leu Gly Leu 
35 40 45 

Ala Gly Asn Ser Phe Arg Pro Glu His Arg Ala Leu Leu Asn Ala Phe 
50 55 60 

Thr Ala Thr Phe Glu Leu Ser Asp Asp Gly Arg Phe Glu Val Trp Asn 
65 70 75 80 

Ala Met Thr Arg Gly Gin His Cys Asp Thr Trp Ser Tyr Val Leu T ie 
85 90 95 



WO 00/06698 



131 



PCT/US99/17130 



Pro Ala Ala Gin Pro Giy Gin Phe Thr Val Asp His Gly Val Gly Arg 
100 105 110 

£er Trp Leu Leu Pro Pro Gly Thr Leu Asp Gla Phe lie Cys Leu Gly 
115 120 125 

Arg Ala Gin Gly Leu Ser Asp Asp Asn lie Val Phe Pro Asp Val Thr 
130 135 140 

Gly Xaa Ala Leu Asp Leu Xaa Ser Leu Pro Trp Val Ala Ala Pro Ala 
145 150 155 160 



<210> 186 
<211> 122 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (122) 

<223> Xaa equals stop translation 
<400> 186 

Met Met Leu Pro Gin Trp Leu Leu Leu Leu Phe Leu Leu Phe Phe Phe 
1 5 10 15 

Leu Phe Leu Leu Thr Arg Gly Ser Leu Ser Pro Thr Lys Tyr Asn Leu 
20 25 30 

Leu Glu Leu Lys Glu Ser Cys lie Arg Asn Gin Asp Cys Glu Thr Gly 
35 40 45 

Cys Cys Gin Arg Ala Pro Asp Asn Cvs Glu Ser His Cys Ala Glu Lys 
50 55 60 

Gly Ser Glu Gly Ser Leu Cys Gin Thr Gin Val Phe Phe Gly Gin Tyr 
6 5 70 75 80 

Arg Ala Cys Pro Cys Leu Arg Asn Leu Thr Cys lie Tyr Ser Lys Asn 
85 90 95 

Glu Lys Trp Leu Ser lie Ala Tyr Gly Arg Cys Gin Lys lie Gly Arg 
100 105 HO 

Gin Lys Leu Ala Lys Lys Met Phe Phe Xaa 
115 120 



<210> 187 
<211> 163 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
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<222> 1163) 

<223> Xaa equals scop translation 
<400> 187 

Met Thr Ser Asn Phe Pro Phe Cys Thr Leu lie Leu Gly lie Ala Gin 
1 5 io 15 

Ala Gin Ala Cys Pro Gly Cys Pro Gly Asp Trp Pro Gly Leu Gly Ser 
20 25 30 

Gly Val Gly Glu Gly Leu His His lie Arg Thr Cys Arg Thr Pro He 
35 40 45 

Pro Cys Ser Pro Pro Ala Pro Ala Ala Ala Cys Leu Gly Ser Gly His 
50 55 €0 

Ala Arg Leu Pro Cys Val Leu Arg Leu Trp Pro Val Pro Ala Asn Leu 
65 70 75 80 

Ser Ser Pro Phe Arg Leu Glu Ala Leu His Cys Ser Phe Trp Ser Ser 
B5 go 95 

Pro Leu Leu Pro Ala Pro His Leu Ala Phe Phe Gly Phe Arg Asp Leu 
100 105 HO 

Leu Thr Asp Phe Leu Leu Ala Ala Cys Leu Leu Thr Phe Gin Lys Thr 
115 120 125 

Pro Leu Glu Leu Pro Met Ala Val Val His Leu Leu Val Ala Thr Pro 
130 135 140 

Cys Tyr Gin Met Leu Asp Asn Leu ?ro Leu Pro Ser Ala Ala Ala Asn 
145 150 155 160 

Trp Cys Xaa 



<210> 188 

<211> 51 

<212> PRT 

<213> Homo sapiens 

•c220> 

<221> SITE 
<222> (51) 

<223> Xaa equals stop translation 
<400> 188 

Met Pro Gly He Leu Ala Gly He Pro Val Lys Asp Leu Cys Leu Ser 
15 io 15 

Leu Leu Gin Gly phe Arg Leu Leu Leu Leu Cys Val Cvs Pro Gly Trp 
20 25 " 30 

Leu Ser Gly Trp Met Gly Gly Gin Lys Gly Ser Pro Arg He Val Asp 
35 4C 45 



lie Gly Xaa 
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<210> 189 
<211> 65 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (65) 

<223> Xaa equals stop translation 
<400> 189 

eu Tyr Leu Gly Val Phe Phe 

10 15 



Met Tyr Leu Tyr Leu Gly Val Phe Phe His Leu He Tyr Pro Gly Ala 
1 5 10 15 

Leu Ser lie Thr Thr Leu Gly Lys His Ser His Pro Phe Phe Thr Ala 
20 25 30 

Glu Gin Asn Ser Thr Val Trp Met Glu His Thr Leu Phe His Gin Ser 
35 40 45 

Pro val Ala Ser His Leu Val Cys Phe Gin Ser Phe Ala Phe Ser Glu 
50 55 60 



Xaa 
65 



<210> 190 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITS 
<222> (47) 

<223> Xaa equals stop translation 
<400> 190 

Met Thr Leu Ser Lau Gin Leu Ala Glu Leu Val His Phe Val Cys Ala 
1 5 io i5 

Phe Gin Ser Gin Trp Thr Gly Val Tyr Pro Met Met Pro Pro Leu Lys 
20 25 30 

Pro Thr Glu Pro Leu Cys Phe Ala Cys Val Pro Cys Arg Val Xaa 
35 40 45 



<210> 191 
<211> 144 
<212> PRT 

<213> Homo sapiens 

<220> 
<221> SITE 
<222> (144) 
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<223> Xaa equals stop translation 
<400> 191 

Met Ser Pro Phe His Leu Leu Gly Leu Lys Val Phe Leu Thr 'i*rp Ala 
15 1C 15 

Leu Thr Leu Ala Gin lie Cys Leu Tyr Phe Phe Glu Val Gin Pro Leu 
20 25 30 

Gly Leu Leu Ala Leu Asn Phe Phe Cys Thr Ala Thr Ala Gly Leu Lys 
35 40 45 

Glu Leu Cys Met His Pro Pro Ser Leu Ala Phe Thr Pro Glu Phe His 
50 55 60 

Thr Ser Leu Ser Pro Leu Ala He Pro Ser Phe Cys Gly Thr Ser Val 
65 70 75 e0 

Ser Leu Ser Asn Ser His Thr lie Pro Leu Ser Leu Tyr Leu Pro Phe 
85 90 95 

Pro Ser Lys Ser Arg Met Pro Asp Thr Leu His Leu Leu Val His Ser 
100 105 no 

Leu Pro Leu Val His Ser Gin Val Leu Pro Val Lys Asp Val Thr lie 
115 120 125 

Glu Trp Pro Leu Cys Gin Arg Cys Leu Gly Ser Thr Cys His Gin Xaa 
130 135 140 



<210> 192 
<211> 81 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (76) 

<223> Xaa equals any of the naturally occurring L- amino acids 
<220> 

<221> SITE 
<222> (81) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 192 

Mat Phe Cys Phe Ser Ser He Phe Cys Ser His Glu His Thr His Leu 
1 5 10 15 

Pro Gly Thr Phe Trp Leu Phe Leu Phe Leu Phe Leu He Leu Pro Pro 
20 25 30 



Ser Cys Pro Cys Phe Leu Pro Phe Ser Leu Ala lie Glu Thr Val Arg 
35 40 45 
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Trp Pro Cys Trp His His Pro Thr Ser Phe Glu Leu Cys Tyr Pro Glv 
50 55 60 

Thr Ser lie Tyr Tyr Ala Ser Arg Gly Gly Pro Jiaa Pro Asn Ser Glu 
65 ™ 75 80 

Xaa 



<210> 193 
<211> 45 
<212> PRT 

<213> Homo sapiens 
: <220> 

<221> SITE r 
<222> (45) 

<223> Xaa equals stop translation 
<400> 193 



Met Thr Tyr Leu Pho Cys Ser Ser He Ser Leu Leu Leu Leu Lys 
1 5 1 n 



Val 

15 



His Ser Ser Gly His Gin Asp He Arg Lys Ala Lys Ser Lys Val Pro 
20 25 30 

Arg Leu Leu He He Gin Cys Pro Gin Gin Arg Glu Xaa 
35 40 45 



<210> 194 
<211> 42 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (42) 

<223> Xaa equals stop translation 
<400> 194 

Mat Pro Thr He Trp Val Lys Leu Cys Leu Leu Gin Val Cys His Gly 
1 5 io 15 

Leu Phe Pro Leu Leu Lys His Trp ser Gin Pro Met Pro Leu Cys Val 
20 25 30 

Thr Leu Ala Pro Val Ser Tyr Trp Leu Xaa 
35 40 



<210> 195 
<211> 260 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
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<222> (260) 

<223> Xaa equals stop translation 
<400> 19S 

Met Gly Thr Ala Ala Leu Gly Pro Val Trp Ala Ala Leu Leu Leu Phe 
1 5 10 15 

Leu Leu Met Cys Glu lie Pro Met Val Glu Leu Thr Phe Asp Arg Ala 
20 25 30 

Val Ala Ser Asp Cys Gin Arg Cys Cys Asp Ser Glu Asp Pro Leu Asp 
35 40 45 

Pro Ala His Val Ser Ser Ala Ser Ser Ser Gly Arg Pro His Ala Leu 
50 55 60 

Pro Glu He Arg Pro Tyr Ue Asn He Thr He Leu Lys Gly Asp Lys 
65 70 75 ao 

Gly Asp Pro Gly Pro Met Gly Leu Pro Gly Tyr Met Gly Arg Glu Gly 
85 90 95 

Pro Gin Gly Glu Pro Gly Pro Gin Gly Ser Lys Gly Asp Lys Gly Glu 
100 105 no 

Met Gly Ser Pro Gly Ala Pro Cys Gin Lys Arg phe Phe Ala Phe Ser 
115 120 125 

Val Gly Arg Lys Thr Ala Leu His Ser Gly Glu Asp Phe Gin Thr Leu 
130 135 140 

Leu Phe Glu Arg Val Phe Val Asn Leu Asp Gly Cys Phe Asp Met Ala 
145 ISO 155 i 6 o 

Thr Gly Gin Phe Ala Ala Pro Leu Arg Gly He Tyr Phe Phe Ser Leu 
165 170 175 

Asn Val His Ser Trp Asn Tyr Lys Glu Thr Tyr Val His He Met His 
180 185 190 

Asn Gin Lys Glu Ala Val He Leu Tyr Ala Gin Pro Ser Glu Arg Ser 
195 200 205 

He Met Gin Ser Gin Ser Val Met Leu Asp Leu Ala Tyr Gly Asp Arg 
21° 215 220 

Val Trp Val Arg Leu Phe Lys Arg Gin Arg Glu Asn Ala He Tyr Ser 
225 230 235 240 

Asn Asp Phe Asp Thr tyr He Thr Phe Ser Gly His Leu He Lys Ala 
245 250 255 

Glu Asp Asp Xaa 
260 



<210> 196 
<211> 117 
<212> PRT 
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<213> Homo sapiens 
<400> 196 

Met Leu Gly His Cys Cys Tyr Phe Trp Gin Val Trp Pro Ala Ser Glu 
15 10 15 

Ala Leu Ala Ala Gly Pro Thr Pro Ser Thr Gly Ser Ser Ser Pro Ser 
20 25 30 

Trp Lys Gin His He Gly Thr Ser Leu Gin Lys Thr Arg Gly Ser Leu 
35 40 45 

Pro Thr Thr Thr Leu Thr Ser Gly Ala Gly Gin Ser Thr Ser Thr Gly 
50 55 60 

* Lys Asn Pro Ala Ala Gly Arg Ser Leu Glu Gly Ala Leu Pro Ala Gly 
65 ™ ; 75 80 

Val Trp Pro Cys Phe Ala Gin Ser Pro Cys Thr Gly Gly Gin Gin Thr 
85 90 95 

Pro Ser Ser Thr Gly Leu Arg Ser Cys Leu Val Arg Ser Pro Ala Thr 
100 105 HO 

Trp Trp Arg Thr Pro 
115 



<210> 197 
<211> 698 
<212> PRT 

<213> Homo sapiens 
<400> 197 

Met Leu Pro Ala Arg Leu Pro Phe Arg Leu Leu Ser Leu Phe Leu Arg 
1 5 io is 

Gly Ser Ala Pro Thr Ala Ala Arg His Gly Leu Arg Glu Pro Leu i,eu 
20 25 30 

Glu Arg Arg Cys Ala Ala Ala Ser Ser Phe Gin His Ser Ser Ser Leu 
35 40 45 

Gly Arg Glu Leu Pro Tyr Asp Pro Val Asp Thr Glu Gly Phe Gly Glu 
50 55 60 

Gly Gly Asp Met Gin Glu Arg Phe Leu Phe Pro Glu Tyr He Leu Aso 
65 70 75 8 b 

Pro Glu Pro Gin Pro Thr Arg Glu Lys Gin Leu Gin Glu Leu Gin Gin 
85 90 95 

Gin Gin Glu Glu Glu Glu Arg Gin Arg Gin Gin Arg Arg Glu Glu Arg 
100 105 no 

Arg Gin Gin Asn Leu Arg Ala Arg Ser Arg Glu His Pro Val Val Gly 
US 120 125 

His Pro Asp Pro Ala Leu Pro Pro Scr Gly Val Asn Cys Ser Gly Cys 
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130 135 



140 



Gly Ala Glu Leu His Cys Glr. Asp Ala Gly Val Pro Gly Tyr Leu Pro 
145 150 155 i 60 

Arg Glu Lys Phe Leu Arg Thr Ala Glu Ala Asp Gly Gly Leu Ala Arg 
165 17C 175 

Thr Val Cys Gin Arg Cys Trp Leu Leu Ser His His Arg Arg Ala Leu 
180 185 190 

Arg Leu Gin Val Ser Arg Glu Gin Tyr Leu Glu Leu Val Ser Ala Ala 
195 200 205 

Leu Arg Arg Pro Gly Pro Ser Leu Val Leu Tyr Met Val Asp Leu Leu 
210 215 220 

Asp Leu Pro Asp Ala Leu Leu Pro Asp Leu Pro Ala Leu Val G 1 y Pro 
225 230 235 " 240 

Lys Gin Leu lie Val Leu Gly Asn Lys Val Aso Leu Leu Pro Gin Asd 
245 250 255 

Ala Pro Gly Tyr Arg Gin Arg Leu Arg Glu Arg Leu Trp Glu Asp Cys 
260 265 270 

Ala Arg Ala Gly Leu Leu Leu Ala Pro Gly His Gin Gly Pro Gin Ara 
275 280 285 

Pro Val Lys Asp Glu Pro Gin Asp Gly Glu Asn Pro Asn Pro Pro Asn 
290 295 300 

Trp Ser Arg Thr Val Val Arg Asp Val Arg Leu He Ser Ala Lys Thr 
305 310 315 320 

Gly Tyr Gly Val Glu Glu Leu lie Ser Ala Leu Gin Arg Ser Trp Arg 
325 330 335 

Tyr Arg Gly Asp Val Tyr Leu Val Gly Ala Thr Asn Ala Gly Lys Ser 
340 345 350 

Thr Leu Phe Asn Thr Leu Leu Glu Scr Asp Tyr Cys Thr Ala Lys Gly 
35 5 360 365 

Ser Asp Ala He Asp Arg Ala Thr He Ser Pro Trp Pro Gly Thr Thr 
370 375 380 

Leu Asn Leu Leu Lys Phe Pro He Cys Asn Pro Thr Pro Tyr Arg Met 
385 390 395 400 

Phe Lys Arg His Gin Arg Leu Lys Lys Asp Ser Thr Gin Ala Glu Glu 
405 410 415 

Asp Leu Ser Glu Gin Glu Gin Asn Gin Leu Asn Val Leu Lys Lys His 
420 425 430 

Gly Tyr Val Val Gly Arg Val Gly Arg Thr Phe Leu Tyr Ser Glu Glu 
435 440 445 
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Gin Lys Asp Asn lie Pro Phe Glu Phe Asp Ala Asp Ser Leu Ala Phe 
450 455 460 

Asp Met Glu Asn Asp Pro Val Met Gly Thr His Lys Ser Thr Lys Gin 
465 470 475 4 80 

Val Glu Leu Thr Ala Gin Asp Val Lys Asp Ala His Trp Phe Tyr Asp 
4 85 490 495 

Thr Pro Gly He Thr Lys Glu Asn Cys lie Leu Asn Leu Leu Thr Glu 
500 505 sio 

Lys Glu Val Asn He Val Leu Pro Thr Gin Ser He Val Pro Arg Thr 
515 520 525 

Phe Val Leu Lys Pro Gly Met Val Leu Phe Leu Gly Ala He Gly Ara 
530 ^535 540 

lie Asp Phe Lou Gin Gly Asn Gin Ser Ala Trp Phe Thr Val Val Ala 
545 55 ° 555 560 

Ser Asn He Leu Pro Val His lie Thr Ser Leu Asp Arg Ala Asp Ala 
565 57o 575 

Leu Tyr Gin Lys His Ala Gly His Thr Leu Leu Gin He Pro Met Glv 
580 585 590 

Gly Lys Glu Arg Met Ala Gly Phe Pro Pro Leu Val Ala Glu Asp He 
595 600 605 

Met Leu Lys Glu Gly Leu Gly Ala Ser Glu Ala Val Ala Asp He Lys 
610 515 620 

Phe Ser Ser Ala Gly Trp Val Ser Val Thr Pro Asn Phe Lvs Asp Arc 
625 630 635 ' 6 40 

Leu His Leu Arg Gly Tyr Thr Pro Glu Gly Thr Val Leu Thr Val Arg 
645 650 655 

Pro Pro Leu Leu Pro Tyr He Val Asn He Lys Gly Gin Arg He Lys 
660 665 670 

Lys Ser Val Ala Tyr Lys Thr Lys Lys Pro Pro Ser Leu Met Ty- Asn 
675 680 6S5 

Val Arg Lys Lys Lys Gly r.ys He Asn Val 
690 695 



<210> 198 
<211> 340 
<212> PRT 

<213> Homo sapiens 
<4O0> 198 

Met Asr. Met Thr Gin Ala Arg Val Leu Val Ala Ala Val Val Gly Leu 
5 10 15 

Val Ala Val Leu Leu Tyr Ala Ser He His Lys He Glu Glu Gly His 
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20 25 30 

Leu Ala Val Tyr Tyr Arg Gly Gly Ala Leu Leu Thr ser Pro Ser Gly 
3 5 40 45 

Pro Gly Tyr His He Met Leu Pro Phe He Thr Thr Phe Arg Ser Val 
50 55 60 

Gin Thr Thx Leu Gin Thr Asp Glu Val Lys Asn Val Pro Cys Gly Thr 
65 70 75 80 

Ser Gly Gly Val Met He Tyr He Asp Arg He Glu Val Val Asn Met 
85 go 95 

Leu Ala Pro Tyr Ala Val Phe Asp lie' Val Arg Asn Tyr Thr Ala Asp 
100 105 no 

Tyr Asp Lys Thx Leu He Phe Asn Lys He His His Glu Leu Asn Gin 
115 120 125 

Phe Cys Ser Ala His Thr Leu Gin Glu Val Tyr He Glu Leu Phe Asp 
130 135 140 

Gin He Asp Glu Asn Leu Lys Gin Ala Leu Gin Lys Asp Leu Asn Leu 
145 150 155 160 

Met Ala Pro Gly Lou Thr He Gin Ala Val Axg Val Thr Lys Pro Lys 
165 170 175 

He Pro Glu Ala He Arg Arg Asn Phe Glu Leu Met Glu Ala Glu Lys 
160 185 190 

Thr Lys Leu Leu He Ala Ala Gin Lys Gin Lys Val Val Glu Lys Glu 

200 205 

Ala Glu Thr Glu Arg Lys Lys Ala Val He Glu Ala Glu Lys He Ala 
210 215 220 

Gin Val Ala Lys He Arg Phe Gin Gin Lys Val Met Glu Lys Glu Thr 
225 230 235 240 

Glu Lys Arg He Ser Glu He Glu Asp Ala Ala Phe Leu Ala Arg Glu 
245 250 2S5 

Lys Ala Lys Ala Asp Ala Glu Tyr Tyr Ala Ala His Lys Tyr Ala Thr 
260 265 270 

Scr Asn Lys His Lys Leu Thr Pro Glu Tyr Leu Glu Leu Lys Lys Tyr 
275 280 285 

Gin Ala He Ala Ser Asn Ser Lys He Tyr Phe Gly Ser Asn He Pro 
290 295 300 

Asn Me; Phe Val Asp Ser Ser Cys Ala Leu Lys Tyr Ser Asp He Arg 
305 310 315 320 

Thr Gly Arg Glu Ser Ser Leu Pro Ser Zys Glu Ala Leu Glu Pro Ser 
325 330 335 
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Gly Glu Asn Val He Gin Asn Lys Glu Ser Thr Gly 
340 345 



<210> 199 
<211> 401 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (307) 

<223> Xaa equals any o£ the naturally occurring L-amino acids 
<400> 199 

Met Met Gly Leu Gly Asn Gly Arg Arg Ser Met Lys Ser Pro Pro Leu 
1 5 ; 10 15 

val Leu Ala Ala Leu Val Ala Cys He He Val Leu Gly Phe Asn Tyr 
20 25 30 

Trp He Ala Ser Ser Arg Ser Val Asp Leu Gin Thr Arg He Met Glu 
35 40 45 

Leu Glu Gly Arg Val Arg Arg Arg Ala Ala Glu Arg Gly Ala Val Glu 
50 55 60 

Leu Lys Lys Asn Glu Phe Gin Gly Glu Leu Glu Lys Gin Arg Glu Gin 
65 70 75 so 

Leu Asp Lys He Gin Ser Ser His Asn Phe Gin Lgu Glu Ser Val Asn 
85 90 g 5 

Lys Leu Tyr Gin Asp Glu Lys Ala Val Leu Val Asn Asn He Thr Thr 
100 105 no 

Gly Glu Arg Leu He Arg Val Leu Gin Asp Gin Leu Lys Thr Leu Gin 
13 -5 120 125 

Arg Asn Tyr Gly Arg Leu Gin Gin Asp Val Leu Gin Phe Gin Lvs Asn 
130 135 140 

Gin Thr Asn Leu Glu Arg Lys Phe Ser Tyr asd Lgu Ser Gin Cys He 
145 150 155 160 

Asn Gin Met Lys Glu Val Lys Glu Gin Cys Glu Clu Arg He Glu Glu 
165 170 175 

Val Thr Lys Lys Gly Asn Glu Ala Val Ala Ser Arg Asp Leu Ser Glu 
180 185 19c 

Asn Aan Asp Gin Arg Gin Gin Leu Gin Ala Leu Ser Glu Pro Gin Pro 
195 200 205 

Arg Leu Gin Ala Ala Gly :eu Pro His Thr Glu Val Pro Gin Gly Lys 
210 215 220 

Gly Asn Val Leu Gly Asn Ser Lys Ser Gin Thr Pro Ala Pro Ser Ser 
225 230 235 240 
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Glu Val Val Leu Asp Ser Lys Arg Gin Val GLu Lys Glu Glu Thr Asn 
245 250 255 

Glu lie Gin Val Val Asn Glu Glu Pro Gin Arg Asp Arg Leu Pro Glr 
260 265 270 

Glu Pro Gly Arg Glu Gin Val Val Glu Asp Arg Pro val Gly Gly Arg 
27 5 280 2B5 

Gly Phe Gly Gly Ala Gly G lu Leu Gly Gin Thr Pro Gin Val Gin Ala 
290 295 300 

Ala Leu Xaa Val Ser Gin Glu Asn Pro. Glu Met Glu Gly Pro Glu Arg 
305 310 315 320 

Asp Gin Leu Val lie Prp Asp Gly Gin Glu Glu Glu Gin Glu Ala Ala 
325 330 335 

Gly Glu Gly Arg Asn Gin Gin Lys Leu Arg Gly Gin Asp Asp Tyr Asn 
340 345 3 50 

Met Asp Glu Asn Glu Ala Glu Ser Glu Thr Asp Lys Gin Ala Ala Leu 
355 360 365 

Ala Gly Asn Asp Arg Asn lie Asp Val Phe Asn Val Glu Asp Gin Lys 
3™ 37S 380 

Arg Asp Thr lie Asn Leu Leu Asp Gin Arg Glu Lys Ara Asn His Thr 
385 390 395 " 400 



<210> 200 
<211> 324 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (3) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 200 

Ala Lys Val Asp Cys Thr Ala His 
10 15 

Val Arg Gly Tyr Pro Thr Leu Lys 
25 30 

Val Lys Tyr Gin Gly Pro Arg Asp 
45 

Leu Gin Thr Leu Asn Glu Glu Pro 
60 

Pro Pro Ser Ala Pro Glu Leu Lys 



Met Glu Xaa Ala 
1 

Ser Asp Val Cys 
20 

Leu Phe Lys Pro 
35 

Phe Gin Thr Leu 
50 

Val Thr Pro Glu 



Lys Val Tyr Val 
5 

Ser Ala Gin Gly 

Gly Gin Glu Ala 
40 

Glu Asn Trp Met 
55 

Pro Glu Val Glu 
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65 ™ 75 80 

Gin Gly Leu Tyr G lu Leu Ser Ala Sex Asn Phe Glu Leu His Val Ala 
85 go 95 

Gin Gly Asp His Phe Tie Lys Phe Phe Ala Pro Trp Cys Gly His Cys 
100 10 5 U0 

Lys Ala Leu Ala Pro Thr Trp Glu Gin Leu Ala Leu Gly Leu Glu His 
US 120 125 

Ser Glu Thr Val Lys He Gly Lys Val Asp Cys Thr Gin His Tyr Glu 
130 13S 140 

Leu Cys Ser Gly Asn Gin Val Arg Gly Tyr Pro Thr Leu Leu T^p Phe 
145 150 155 i60 

Arg Asp Gly Lys Lys Val Asp Gin Tyr Lys Gly Lys Arg Asp Leu Glu 
165 170 175 

Ser Leu Arg Glu Tyr Val Glu Ser Gin Leu Gin Arg Thr Glu Thr Glv 
!80 IBS 190 

Ala Thr Glu Thr Val Thr Pro Ser Glu Ala Pro Val Leu Ala Ala Glu 
195 200 205 

Pro Glu Ala Asp Lys Gly Thr Val Leu Ala Leu Thr Glu Asn Asn Fhe 
210 215 220 

Asp Asp Thr He Ala Glu Gly He Thr Phe He Lys Phe Tyr Ala Pro 
225 230 235 240 

Trp Cys Gly His Cys Lys Thr Leu Ala Pro Thr Trp Glu Glu Leu Ser 
245 250 255 

Lys Lys Glu Phe Pro Gly Leu Ala Gly Val Lys He Ala Glu Val Asp 
260 265 270 

Cys Thr Ala Glu Arg Asn He Cys Ser Lys Tyr Ser Val Arg Gly Tvr 
275 280 285 

Pro Thr Leu Leu Leu Phe Arg Gly Gly Lys Lys Val Ser Glu His Ser 
290 295 300 

Gly Gly Arg Asp Leu Asp Ser Leu His Arg Phe Val Leu Ser Gin Ala 
305 310 315 

Lys Asp. Glu Leu 



<210> 201 
<211> 90 
<212> PRT 

<213> Homo sapiens 
<400> 201 

Met Ala Leu Phe ser Cys Leu Leu Leu Leu Lys Gin Ser Asp Gly Ala 
1 5 io 



15 
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Ser Pro Val Leu Arg Ala Leu Ala Ala Ser Cys Leu Ala Ser Pro Ala 
20 25 30 

Gly Cys Cys Gly Thr Arg Lys Ala Leu Asn Gly Asn Val G^y Glu Lys 
35 40 45 

Val Gly Pha Thr Phe Met Ser Phe Gin Gly Cys Asp Pro Ser Ser Pro 
50 55 60 

Gly Cys Leu Cys Cys Ser Leu Leu Pro Ser Asn Ser Gin Leu Val Phe 
65 7 ° 75 80 

He ser Phe Leu Val Leu Ser Gly Leu Ala 
85 go 



<210> 202 
<211> 243 
<212> PRT 
<213> Homo sapiens 

<400> 202 

Met Arg Pro Gin Gly Pro Ala Ala Ser Pro Gin Arg Leu Arg Gly Leu 
1 5 ' 10 15 

Leu Leu Leu Leu Leu Leu Gin Leu Pro Ala Pro Ser Ser Ala Ser Glu 
20 25 30 

lie Pro Lys Gly Lys Gin Lys Ala Gin Leu Arg Gin Arg Glu Val Val 
35 40 45 

Asp Leu Tyr Asn Gly Met Cys Leu Gin Gly Pro Ala Gly Val Pro Glv 
50 55 60 

Arg Asp Gly Ser Pro Gly Ala Asn Gly He Pro Gly Thr Pro Gly He 
65 ™ 75 go 

Pro Gly Arg Asp Gly Phe Lys Gly Glu Lys Gly Glu Cys Leu Arg Glu 
85 go 95 

Ser Phe Glu Glu Ser Trp Thr Pro Asn Tyr Lys Gin Cys Ser Trp Ser 
100 105 no 

Ser Leu Asn Tyr Gly He Asp Leu Gly Lys lie Ala Glu Cys Thr Phe 
US 120 125 

Thr Lys Ket Arg ser Asn Ser Ala Leu Arg val Leu Phe Ser Gly Ser 
130 135 140 

Leu Arg Leu Lys Cys Arg Asn Ala Cys Cys Glu Arg Trp Tyr Phe Thr 
145 "0 155 160 

Phe Asn Gly Ala Glu Cys Ser Gly Pro Leu Pro He Glu Ala He He 
165 170 175 

Tyr Leu Asp Gin Gly Ser Pro Glu Met Asn Ser Thr He Asn He His 
180 185 190 
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Arg Thr Ser Ser Val Glu Gly Leu Cys Glu Gly He Gly Ala Gly Leu 

195 200 205 

Val Asp Val Ala He 'i'rp Val Gly Thr Cys Ser Asp Tyr Pro Lys Glv 

210 215 220 

Asp Ala Ser Thr Gly Trp Asn Ser Val Ser Arg Tie He He Glu Glu 

225 230 235 2 40 

Leu Pro Lys 



<210> 203 
<211> 75 
<212> PRT 

<213> Homo sapiens 



<400> 203 

Met Ala Gly Gin Glu Asp Pro Val Gin Arg Glu He His Gin Asp Trp 
1 5 io 



15 



Ala Asn Arg Glu Tyr lie Glu He He Thr Ser Ser He Lys Lys He 
20 25 30 

Ala Asp Phe Leu Asn Ser Phe Asp Met Ser. Cys Arg Ser Arg Leu Ala 
35 40 45 

Thr Leu Asn Glu Lys Leu Thr Ala Leu Glu Arc Arg He Glu Tyr He 
5 ° 55 " 60 

Glu Ala Arg Val Thr Lys Gly Glu Thr Leu Thr 
65 70 75 

<210> 204 
<211> 248 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (185) 

<223> Xaa equals any o£ the naturally occurring L-amino acids 
<400> 204 

Met Thr Ser Gin Pro Val Pro Asn Glu Thr He He Val Leu Pro Ser 
1 5 10 is 

Asn Val He Asn Phe Ser Gin Ala Glu Lys Pro Glu Pro Thr Asn Gin 
20 25 30 

Gly Gin Asp Ser Leu Lys Lys His Leu His Ala Glu He Lys Val He 
35 40 45 

Gly Thr lie Gin He Leu Cys Gly Met Met Val Leu Ser Leu G'y He 
50 55 60 

He Leu Ala Ser Ala Ser Phe Ser Pro Asn Phe Thr Gin Val Thr Ser 
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65 70 ' 75 so 

•?hr Leu Leu Asn Ser Ala Tyr Pro Phe lie Gly Pro Phe Phe Phe lie 
85 9c 95 

lie Ser Gly Ser Leu Ser lie Ala Thr m» Lys Arg r.eu Thr Lys Leu 
100 105 110 

Leu val Kis Ser Ser Leu Val Gly Ser He Leu Ser Ala Leu Ser Ala 
115 120 125 

Leu Val Gly Phe lie He Leu Sor Val Lys Gin Ala Thr Leu Asn *ro 
130 135 140 

Ala Ser Leu Gin Cys Glu Leu Asp Lys Asn Asn He Pro Thr Arg Ser 
145 150 155 iso 

Tyr Val Ser Tyr Phe Tyr His Asp Ser Leu Tyr Thr Thr Asp Cys Tyr 
165 170 175 

Thr Ala Lys Ala Ser Leu Ala Gly Xaa Leu Ser Leu Met Leu He Cys 
1B0 185 190 

Thr Leu Leu Glu Phe Cys Leu Ala Val Leu Thr Ala Val Leu Ara Tm 
195 200 2 05 

Lys Gin Ala Tyr Scr Asp Phe Pro Gly Ser Val Leu Phe Leu Pro His 
210 215 220 

Ser Tyr He Gly Asn Ser Gly Met Ser Ser Lys Met Thr His Asp Cys 
225 230 235 2 io 

Gly Tyr Glu Glu Leu Leu Thr Ser 
245 



<210> 205 
<211> 1GB 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (83) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<400> 205 

Met pro Leu Leu Arg Gly Leu Leu Trp Leu Gin Val Leu Cys Ala Gly 
1 5 10 is 

Pro Leu His Thr Glu Ala Val Val Leu Leu Val Pro Ser Asp Asp Gly 
20 25 30 

Arg Ala Phe Leu Leu Arg Ser Arg Leu Leu His Pro Glu Ala His Val 
35 40 45 

Pro Pro Ala Ala Asp Arg Gly Ala Ser Leu Gin Cys Val Leu His Gin 
50 55 60 
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Ala Ala Pro Lys Ser Arg Pro Arg Ser Pro Ala Ala Gly Ala Ala Leu 
65 70 75 ■ 80 

Leu His Xaa Pro Arg Arg Thr Gly Asp Glu Pro Cys Arg Glu Phe His 
85 90 95 

Gly Asn Gly Phe Pro Gly Pro Thr Gin Leu Thr Pro Gly Glu Cys Gly 
100 105 no 

Leu Pro Ala Pro Ser Ser Leu Leu Gin His Ala Ser Ala Pro Val Arg 
115 120 125 

Thr Gly Ser Glu Gly Gin Val Val Gly Cys Pro Arg Ala Arg Gly Glu 
130 135 140 

Thr Gly Glu Gly Leu Ser Leu Ala Phe Leu Ser Ser Leu Met Phe Thr 
145 150 r 155 160 

Ser Arg Asn Gly Leu Val Gly Cys 
165 



<210> 206 

<211> 21B 

<212> PRT 

<213> Homo sapiens 

<400> 206 

Met Gly Ser Ala Ala Leu Glu lie Leu Gly Leu Val Leu Cys Leu Val 
1 5 io 1S 

Gly Trp Gly Gly Leu lie Leu Ala Cys Gly Leu Pro Met Trp Gin Val 
?0 25 30 

Thr Ala Phe Leu Asp His Asn He Val Thr Ala Gin Thr Thr Trp Lys 
35 40 45 

Gly Leu Trp Met Ser Cys Val Val Gin Ser Thr Gly His Met Gin Cvs 
50 55 60 

Lys Val Tyr Asp Ser Val Leu Ala Leu Ser Thr Glu Val Gin Ala Ala 
65 70 75 80 

Arg Ala Leu Thr Val Ser Ala Val l,eu Leu Ala Phe Val Ala Leu Phe 
85 90 95 

Val Thr Leu Ala Gly Ala Gin Cys Thr Thr Cys Val Ala Pro Gly Pro 
100 105 no 

Ala Lys Ala Arg Val Ala Lgu Thr Gly Gly Val Leu Tyr Leu Phe Cvs 
115 120 125 

Gly Leu Leu Ala Leu Val Pro Leu Cys Trp Phe Ala Asn He Val Val 
130 135 140 

Arg Glu Phe Tyr Asp Pro Ser Val Pro Val Ser Gin Lys Tyr Glu Leu 
145 150 155 160 

Gly Ala Ala Leu Tyr Tie Gly Trp Ala Ala Thr Ala Leu Leu Met Val 
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165 170 175 

Gly Gly Cys Leu Leu Cys Cys Gly Ala Trp Val Cys Thr Gly Arg Pro 
180 185 190 

Asp Leu ser Phe Pro Val Lys Tyr Ser Ala Pro Arg Arg Pro Thr Ala 
195 200 205 

Thr Gly Asp Tyr Asp Lya Lys Asn Tyr Val 
210 215 



<210> 207 
<211> 73 
<212> PRT 

<213> Homo sapiens 

<220> r . 

<221> SITE 
<222> (73) 

<223> Xaa equals stop translation 
<400> 207 

Het Thr Ser Tyr He Leu lie Ser Phe Val Leu Leu lie Gly Val Gly 
1 5 io is 

Cys lie Glu Lys Asp Gin Ser Cys Pro Val Phe Gly Gly Arg Lys Arg 
20 2S 30 

Leu His Leu Leu Phe Val Gly Gly Gin Leu Arg Gin Val Arg Met Leu 
35 40 45 

Arg Gly Glu Leu Ser Cys Ala Cys Tyr Arg Pro His Val Cln Ala Leu 
50 55 60 

Gin Leu Gly Gly Cys Thr Cys Phe Xaa 
65 70 



<210> 208 
<211> 348 
<212> PRT 

<213> Homo sapiens 
<400> 208 

Met Leu Cys Pro Trp Arg Thr Ala Asn Leu Gly Leu Leu Leu He Leu 
1 5 io i 5 

Thr He Phe Leu Val Ala Glu Ala Glu Gly Ala Ala Gin Pro Asn Asn 
20 25 30 

Ser Leu Met Leu Gin Thr Ser Lys Glu Asn Kis Ala Leu Ala Ser Ser 
35 40 45 

Ser Leu Cys Met Asp Glu Lys Gin He Thr Gin Asn Tyr Ser Lys Val 
50 55 60 

Leu Ala Glu Val Asn Thr Ser Trp Pro Val Lys Met Ala Thr Asr Ala 
6o 70 75 an 
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Val Leu Cys Cys Pro Pro lie Ala Leu Arg Asn Leu He He He Thr 
85 90 95 

Trp Glu He He Leu Arg Gly Gin Pro Ser Cys Thr Lys Ala Tyr Lys 
100 105 HO 

Lys Glu Thr Asn Glu Thr Lys Glu Thr Asn Cys Thr Asp Glu Arg He 
115 120 125 

Thr Trp Val Ser Arg Pro Asp Gin Asn Ser Asp Leu Gin He Arg Thr 
130 135 140 

Val Ala He Thr His Asp Gly Tyr Tyr Arg Cys He Met Val Thr Pro 
i45 ISO 155 160 

Asp Gly Asn Phe His Argr, Gly Tyr His Leu Gin Val Leu Val Thr Pro 
165 ' 170 175 

Glu Val Thr Leu Phe Gin Asn Arg Asn Arg Thr Ala Val Cys Lvs Ala 
180 185 190 

Val Ala Gly Lys Pro Ala Ala His He Ser Trp He Pro Glu Gly Asp 
195 200 205 

Cys Ala Thr Lys Gin Glu Tyr Trp Ser Asn Cly'Thr Val Thr Val Lys 
210 215 220 

Ser Thr Cys His Trp Glu Val His Asn Val Ser Thr Val Asn Cys His 
225 230 23S 240 

Val Ser His Leu Thr Gly Asn Lys Ser Leu Tyr He Glu Leu Leu Pro 
245 250 255 

Val Pro Gly Ala Lys Lys Ser Ala Lys Leu Tyr He Pro Tyr He He 
260 255 270 

Leu Thr He He He Lau Thr He Val Gly Phe He Trp Leu Leu Lys 
2? 5 280 285 

Val Asn Gly Cys Arg Lys Tyr Lys Leu Asn Lys Thr Glu Ser Thr Pro 
290 295 300 

Val Val Glu Glu Asp Glu Met Gin Pro Tyr Ala Ser Tyr Thr Glu Lys 
305 310 315 320 

Asn Asn Pro Leu Tyr Asp Thr Thr Asn Lys Val Lys Ala Ser Glu Ala 
325 330 335 

Leu Gin Ser Glu Val Asp Thr Asp Leu His Thr Leu 
340 345 



<210> 209 

<211> 73 

<212> PRT 

<213> Homo sapiens 



<220> 
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150 

<221> SITE 
<222> (73) 

<223> Xaa equals stop trans lac ion 
<400> 209 

Met Ala Arg Gly Cys Val Cys Ser Leu Cys Ala Ser Val Cys lie Phe 
1 5 .10 is 

Leu Ser Ser Leu Phe Pro Leu Leu Pro Ser Val His Ser Val Asn He 
20 25 30 

He Ser Cys Leu Leu Leu Ser Lys Cys Phe Glu Gly Leu Glu Leu Met 
35 40 45 

Cys Glu His Leu Tyr Gin Leu Ser Gin Leu His Val Leu His His He 
50 55 60 

Phe Ser Tyr Leu Leu Cys Thr Pro Xaa 
S5 70 



<210> 210 
<211> 608 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (265) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (597) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 210 

Met Val Gly Thr Lys Leu Arg Gin Thr Lys Asp Ala Leu Phe Th~ He 
1 5 10 is 

Leu His Asp Leu Axg Pro Gin Asp Arg Phe Ser He He Gly Phe Ser 
20 25 30 

Asn Arg He Ly 3 Val Trp Lys Asp His Leu He Ser Val Thr Pro Asp 
35 40 45 

Sex He Arg Asp Gly Lys Val Tyr lie His His Met Ser Pro Thr Gly 
50 55 60 

Gly Thr Asp He Asn Gly Val Leu Gin Arg Ala He Arg Leu Leu Asn 
65 70 75 eo 

Lys Tyr Val Ala His Ser Gly He Gly Asp Arg Ser Val Ser Leu He 
85 90 95 

Val Phe Leu Thr Asp Gly Lys Pro Thr Val Gly Glu Thr His Thr Leu 
100 105 no 

Lys He Leu Asn Asn Thr Arg Glu Ala Ala Arg GLy Gin Val Cys He 
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115 120 125 

Phe Thr He Gly He Gly Asn Asp Val Asp Phe Arg Leu Leu Glu Lys 
130 135 140 

Leu Ser Leu Glu Asn Cys Gly Leu Thr Arg Arg Val His Glu Glu Glu 
145 150 155 ifio 

Asp Ala Gly Ser Gin Leu Ho Gly Phe Tyr Asp Glu He Arg Thr Pro 
165 170 175 

Leu Leu Ser Asp He Arg He Asp Tyr Pro Pro Ser Ser Val Val Gin 
180 195 190 

Ala Thr Lys Thr Leu Phe Pro Asn Tyr Phe Asn Gly Ser Glu He H e 
19 5 200 205 

He Ala Gly Lys Leu Val Asp Arg Lys Leu Asp His Leu His- Val Glu 
210 215 220 

Val Thr Ala Ser Asn Ser Lys Lys Phe lie lie Leu Lys Thr Asp Val 
225 230 235 240 

Pro Val Arg Pro Gin Lys Ala Gly Lys Asp Val Thr Gly Ser Pro Arg 
245 250 255 

Pro Gly Gly Asp Gly Glu Gly Asp Xaa Asn His He Glu Arg Leu T-p 
260 265 270 

Ser Tyr Leu Thr Thr Lys Glu Leu Leu Ser Ser Trp Leu Gin Ser Asp 
275 280 285 

Asp Glu Pro Glu Lys Glu Arg Leu Arg Gin Arg Ala Gin Ala Leu Ala 
29° 295 300 

val ser Tyr Arg Phe Leu Thr Pro Phe Thr Ser Met Lys Leu Arg Gly 
305 310 315 320 

Pro Val Pro Arg Met Asp Gly Leu Glu Glu Ala His Gly Met Ser Ala 
325 330 335 

Ala Met Gly Pro Glu Pro Val Val Gin Ser Val Arg Gly Ala Gly Thr 
340 345 350 

Gin Pro Gly Pro Leu Leu Lys Lys Pro Tyr Gin Pro Arg He Lys He 
355 360 365 

Ser Lys Thr Ser Val Asp Gly Asp Pro Hie Phe Val Val Asp Phe Pro 
370 375 380 

Leu Ser Arg Leu Thr Val Cys Phe Asn He Asp Gly Gin Pro Gly Asp 
385 390 395 /.00 

He Leu Arg Leu Val Ser Asp His Arg Asp Ser Gly Val Thr Val Asn 
4C5 410 415 

Gly Glu Leu He Gly Ala Pro Ala Pro Pro Asn Gly His Lys Lys Gin 
420 425 430 
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Arg Thr Tyr Leu Arg Thr lie Thr lie Leu lie Asn Lys Pro Glu Arg 
435 44C 445 

Ser Tyr Leu Glu lie Thr Pro Ser Arg Val lie Leu Asp Gly Gly Asp 
450 455 460 

Arg Leu Val Leu Pro Cys Asn Gin Ser Val Val Val Gly Ser Trp Gly 
465 470 475 480 

Leu Glu Val Ser Val Ser Ala Asn Ala Asn Val Thr Val Thr He Gin 
485 490 495 

Gly Ser He Ala Phe Val lie Leu He His Leu Tyr Lys Lys Pro Ala 
500 505 510 

Pro Phe Gin Arg His His Leu Gly Phe Tyr He Ala Asn Ser Glu Gly 
515 520 525 

Leu Ser Ser Asn Cys His Gly Leu Leu Gly Gin Phe Leu Asn Gin Asp 
530 535 

Ala Arg Leu Thr Glu Asp Pro Ala Gly Pro Ser Gin Asn Leu Thr His 
545 550 555 560 

Pro Leu Leu Leu Gin Val Gly Glu Gly Pro Glu Ala Val Leu Thr Val 
565 570 575 

Lys Gly His Gin Val Pro Val Val Trp Lys Gin Arg Lys He Tyr Asn 
580 585 590 

Gly Glu Glu Gin Xaa Asp Cys Trp Phe Ala Arg Asn Met Pro Pro Asn 
595 600 605 



<210> 211 

<211> 252 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (252) 

<223> Xaa equals stop translation 

<400> 211 

Met Ala Pro Ala Ser Arg Leu Leu Ala Leu Trp Ala Leu Ala Ala Val 
1 5 10 15 

Ala Leu Pro Gly Ser Gly Ala Glu Gly Asp Gly Gly Trp Arg Pro Gly 
20 25 30 

Gly Pro Gly Ala Val Ala Glu Glu Glu Arg Cys Thr Val Glu Arg Arg 
35 40 45 



Ala Asp Leu Thr Tyr Ala Glu Phe Val Gin Gin Tyr Ala Phe Val Arg 
50 55 60 
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Pro Val He Leu Gin Gly Leu Thr Asp Asn Ser Arg Phe Arg Ala lieu 
65 7 ° 75 so 

Cys Ser Arg Asp Arg Leu Leu Ala Ser Phe Gly Asp Arg Val Val Arg 
85 90 95 

Leu Ser Thr Ala Asn Thr Tyr Ser Tyr His Lys Val Asp Leu Pro Phe 
100 105 no 

Gin Glu Tyr Val Glu Gin Leu Leu His Pro Gin Asp Pro Thr Ser Leu 
115 120 125 

Gly Asn Asp Thr Leu Tyr Phe Phe Gly Asp Asn Asn Phe Thr Glu Trp 
"0 135 UQ 

Ala Ser Leu Phe Arg His.. Tyr Ser Pro Pro Pro Phe Gly Leu Leu Gly 
145 «0* 155 160 

Thr Ala Pro Ala Tyr Ser Phe Gly He Ala Gly Ala Gly Ser Gly Val 
165 170 175 

Pro Phe His Trp Kis Gly Pro Gly Tyr Ser Glu Val He Tyr Gly Arg 
180 185 190 

Lys Arg Trp Phe Leu Tyr Pro Pro Glu Lys Thr Pro Glu Phe His Pro 
195 200 205 

Asn Lys Thr Thr Leu Ala Trp Leu Arg Asp Thr Tyr Pro Ala cys Thr 
210 215 220 

Val Cys Thr Ala Leu Glu Cys Thr He Arg Ala Gly Glu Val Leu Thr 
225 230 235 240 

Ser Arg Pro Leu Val Ala cys Tyr Ala Gin Pro Xaa 
245 250 



<210> 212 
<211> 226 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (226) 

<223> Xaa equals stop translation 
<400> 212 

Met Lys Glu He Pro Ala Leu Leu His Leu Pro Val Leu He He Met 
1 5 10 15 

Ala Leu Ala He Leu Ser Phe Cys Tyr Gly Ala Gly Lys Ser Val His 
20 25 30 

Val Leu Arg His He Gly Gly Pro Glu Arg Glu Pro Pro Glr Ala Leu 
35 4 o 45 



Arg Pro Arg Asp Aru Arg Arg Gin Glu Glu He 



Asp Tyr Arg Pro Asp 
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50 

Gly Gly Ala Gly Asp 
65 

Thr Glu Gin Gly Pro 
85 

Leu Arg Glu Arg Asp 
100 

Pro Glu Val Leu Arg 
115 

His Pro Tlu Val Val 
130 

Pro Lys Glu Thr Gly 
145 

Ser Thr Glu Ser Ser 
165 

Ser Gly Asn Thr Glu 
180 

Ser Glu Ala Ala Gly 
195 

Arg Gly Val Ala Gly 
210 

Gly Xaa 
22 o 



55 

Ala Asp Phe His Tyr Arg 
70 75 

Tyr Ala Lys Thr Tyr Glu 
90 

Val Asp Leu Arg Phe Gin 
105 

Ala Phe Asp Val Pro Asp 
120 

Pro Ser His Lys Ser Pro 
135 

Gly lie Leu Gly Glu Gly 
ISO 155 

Gin Ser Ala Lys Pro Val 
170 

Gly Ser Pro Ala Ala Glu 
185 

Ser Pro Asp Gin Gly Ser 
200 

Pro Arg Gly Gin Asp Pro 
215 



60 



Gly Gin Met Gly Pro 
80 

Gly Arg Arg Glu lie 
95 

Thr Gly Asn Lys Ser 
110 

Ala Glu Ala Arg Glu 
125 

Val Leu Asp Thr Lys 
140 

Thr Pro Lys Glu Ser 
160 

Ser Gly Gin Asp Thr 
175 

Lys Ala Gin Leu Lys 
190 

Thr Tyr Ser Pro Ala 
205 

Val Ser Ser Pro Cys 
220 



<210> 213 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> <51) 

<223> Xaa equals stop translation 
<400> 213 

Met Met Gly Leu Leu Glu Thr Gly Asn Val wcu Phe Trp Val Trp Val 
1 5 10 15 

Val Val Thr Cys Val Tyr Ser Leu Tyr Ala Asn Ser Leu Asn Cys Thr 
20 25 30 

Asp Met Asp Cys Ala Pro Phe Tyr Ket Cys Val Met Leu Gin Gin Lys 
35 40 45 

Cys Gin Xaa 
50 
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<210> 214 
<211> 172 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (172) 

<223> Xaa equals stop translation 
<400> 214 

Met Trp Leu Trp Ala Val Ser Pro Val Arg Pro Arg Thr Cys Leu Pro 
1 5 .10 15 

Pro Cys Pro Arg Leu Trp Leu Trp lie Ser Met Thr Leu Val Pro Ser 
20 . 25 3 o 

Ser Ser Ala Trp Lya Ser His Gly Ala Pro Ser Thr Arg Met Thr Ser 
35 40 45 

Pro Gin Leu Leu Leu Leu Ser Thr Arg Pro Pro Gin Ser Pro Ser Ala 
50 55 60 

Ser Pro Pro He Ala Arg Ala His Arg Thr His Pro His Phe Gly Asn 
65 ™ 75 80 

Arg Leu Ser Tie Thr Cys Cys Asp Gly Arg Arg Ser Trp Arg Met Gly 
8S 90 95 

Gin His Gly Pro Cys His Leu Asn Leu Gin Thr Thr His Pro Ala His 
iOO 105 no 

Ser Ser Gin Ala Leu Pro Ala Thr His Gin Pro Leu Gly Pro Trp Cys 
115 120 125 

Ser Ser Pro Ser Pro Phe Pro Ser Lys Leu Pro Ser Ala Gly Leu Arg 
130 13S 140 

Pro Pro Ala Leu Giy Pro Trp Met Arg Arg Gly Pro Trp Pro Gin Ser 
145 150 155 160 

Trp Gin Met Gly Met His Pro Thr Val Gly Leu Xaa 
165 170 



<210> 215 

<211> 4fl 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals stop translation 
<400> 215 

Met Trp Leu Leu He He Phe Cys Lys Ser Ala Ser Ala Ser Val Leu 
1 S 10 1S 
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Cys Trp lie Lys Lys Phe His Pro Val Phe Gin Glu Ser Leu Leu Tyr 
20 25 . 30 



Leu Val Gin Glu Gly Ser Leu Cys Tyr Val Gin Gin Lys Val Pro Xaa 
35 40 45 



<210> 216 
<211> 139 
<212> PRT 
<213> Homo sapiens 

<400> 216 

Met Glu Ala Val Val Phe Val Phe Ser Leu Leu Asp Cys Cys Ala Leu 
1 5 10 15 

lie Phe Leu Ser Val Tyr Phe lie lie Thr Leu Ser Asp Leu Glu Cys 
20 25 30 

Asp Tyr He Asn Ala Arg Ser Cys Cys Ser Lys Leu Asn Lys Trp Val 
35 40 45 

lie Pro Glu Leu lie Gly His Thr He Val Thr Val Leu Leu Leu Met 
50 55 60 

Ser Leu His Trp Phe He Phe Leu Leu Asn Leu Pro Val Ala Thr Trp 
65 70 75 80 

Asn He Tyr Arg Tyr He Met Val Pro Ser Gly Asn Met Gly Val Phe 
85 90 95 

Asp Pro Thr Glu He His Asn Arg Gly Gin Leu Lys Ser His Met Lys 
100 105 110 

Glu Ala Met He Ly3 Leu Gly Phe His Leu Leu Cys Phe Phe Met Tyr 

120 125 

Leu Tyr Ser Met He Leu Ala Leu He Asn Asp 
130 135 



<210> 217 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals stop translation 
<400> 217 

Met Ser Gly Ser Ser Leu Pro Ser Ala Leu Ala Leu Ser Leu Leu Leu 
1 5 io is 
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Val Ser Gly Ser Leu Leu Fro Gly Pro Gly Ala Ala Gin Asn Val Ara 
20 25 30 - 

Val Gin Ser Gly Gin Asp Gin Lys Xaa 
35 40 



<210> 218 
<211> 52 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (52) 

<223> xaa equals stop translation 
<400> 218 ? 

Met Pro Ser His lie Arg Ala His Leu Phe Leu Leu Leu Phe Phe Leu 
1 5 10 15 

Phe He Tyr Gin Gly lie Ser Ser He Ser Gin Ala Ser Gly Leu Thr 
20 25 30 

Leu Lys Thr Gin Asn Glu Lys Asp lie Gin Val Ser He Leu Lys Glu 
35 40 



45 



Phe Val Val Xaa 
50 



<210> 219 
<211> 49 
<212> PRT 

< 213> Homo sapiens 
<220> 

<221> SITE 
<222> (49) 

<223> Xaa equals stop translation 
<400> 219 

Met Cys He Tyr Gin Ser Glu Gin Met Leu Ala Lou Leu Leu Val Leu 
1 5 10 15 

Val Phe Cys He Ser Leu Leu Val Leu Val Cys Trp Gly Ser His Asn 
20 25 30 

Lys Val Tro Gin Lys Phe He Phe Ser Gin Phe Trp Gly Leu Glu Asp 
35 40 45 

Xaa 



<210> 220 
<2H> 42 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (42) 

<223> Xaa equals scop translation 
<400> 220 

Met Ala Val Pro Leu Phe Leu Tyr lie Phe Thr Leu Leu Pro Leu Leu 
15 10 15 

?ro Phe Leu Leu Ser Leu Cys Phe Ser Pro Leu Thr Val Lys Arg Ser 
20 25 30 

Ser Ser Ser Glu Ser Lys Ser Ser Leu Xaa 
35 40 

<210> 221 
<211> 41 
' <212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (41) 

<223> xaa equals stop translation 
<400> 221 

Met Gly Met Leu Leu Ala Phe Trp Leu Pro Gly Ala Ser Trp Gin Glu 



10 15 



Ala Gly Pro Arg Ala Ser Thr Gin Arg Met Arg Thr Gin Thr Gin Met: 
20 25 30 

Ser Thr Arg Lys Pro Lys Pro Ala Xaa 
35 40 

<210> 222 

<211> 43 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (43) 

<223> Xaa equals stop translation 
<400> 222 

Met Glu Pro Ala Met Val Leu Lys Phe Leu Ser Ser Leu Pro Glu Asn 



10 15 



Leu Phe Leu Pro Ser Leu . Lou Phe Phe Ala Trp Leu Cys Trp Asn Met 
20 25 30 

Val Cys Gly Ser Pro Val Ser Cys Pro Tyr Xaa 
35 <0 
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<210> 223 
<211> 204 
<212> PRT 

<213> Komo sapiens 
<220> 

<221> SITE 
<222> (204) 

<223> Xaa equals stop translation 
^400> 223 

Met Gin Leu Gly Ser Val Leu Leu Thr Arg Cys Pro Phe Trp Gly Cys 
1 5 10 is 

Phe Ser Gin Leu Met Leu Tyr Ala Glu Arg Ala Glu Ala Arg Arg Lys 
20 25 30 

Pro Asp lie Pro Val Pro Tyr Leu Tyr Phe Asp Met Gly Ala Ala Val 
35 40 45 

Leu Cys Ala Ser Phe Met Ser Phe Gly Val Lys Arg Arg Trp »he Ala 
50 55 60 

Leu Gly Ala Ala Leu Gin Leu Ala He Ser Thr Tyr Ala Ala Tyr He 
65 7 ° 75 so 

Gly Gly Tyr Val His Tyr Gly Asp Trp Leu Lys Val Arg Met Tyr Ser 
85 go 95 

Arg Thr Val Ala He He Gly Gly Phe Leu Val Leu Ala Ser Gly Ala 
100 105 no 

Gly Glu Leu Tyr Arg Arg Lys Pro Arg Ser Arg Ser Leu Gin Ser Thr 
115 120 125 

Gly Gin Val Phe Leu Gly He Tyr Leu He Cys Val Ala Tyr Ser Leu 
"0 135 140 

Gin His Ser Lys Glu Asp Arg Leu Ala Tyr Leu Asn His Leu Pro Glv 
14S ISO 155 160 

Gly Glu Leu Met He Gin Leu Phe Phe Val Leu Tyr Gly He Leu Ala 
165 170 175 

Pro Gly Leu Ser Val Arg Leu Leu Arg Asp Pro Arg Cys Pro Asp Pro 
180 185 190 

Gly Cys Thr Ala Ala Pro Cys His Ala Ala His Xaa 
195 200 



<210> 224 
<211> 43 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> {43 ) 
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<223> Xaa equals stop translation 
<400> 224 

Met Arg Val Arg lie Gly Leu Thr Leu Leu . Leu cys Ala Val Leu Leu 
IS io is 

Ser Leu Ala Ser Ala Ser Ser Asp Glu Gl u Gly Ser Gin Asp Glu Ser 
20 2S 30 

Leu Gly phe Gin Asp Tyr Plie Asp lie Arg Xaa 
35 40 



<210> 225 
<211> 156 
<212> PRT 
<213> Homo sapiens 

■» 

<22Q> 
<221> SITE 
<222> (156) 

<223> Xaa equals stop translation 
<400> 225 

Met Ala Arg Gly Ser Leu Arg Arg Leu Leu Arg Leu Leu Val Leu Gly 
1 5 io 15 

Leu Trp Leu Ala Leu Leu Arg Ser Val Ala Gly Glu Gin Ala Pro Gly 
20 25 30 

Thr Ala Pro Cys Ser Arg Gly Ser Ser Trp Ser Ala Asp Leu Asp Lys 
35 40 45 

Cys Met Asp Cys Ser Thr Ser Cys Pro Leu Pro Ala Ala Leu A* a His 
50 55 60 

Pro Trp Gly Arg Ser Glu Pro Asp Leu Arg Ala Gly Ala Ala Phe Trp 
65 70 75 80 

Leu Phe Gly Leu Glu Thr Met Pro Gin Clu Arg Glu Val His His Pro 
85 90 95 

His Arg Gly Asp Arg Arg Arg Gly Leu Pro Ser Cys Gly Ala Asp Pro 
100 105 HQ 

val Thr Ket Cys Pro Leu Pro Ala Gly Ala Arg Pro Leu lie II© His 
115 120 125 

Ser Ser He Leu Glu Pro Val Ser Ala Ser Gin Thr Arg Arg Glu Pro 
130 135 140 

Ser Ser Ser Asn His Lys Gly Gly Gjy Gly Ara Xaa 
145 iso 15 5 



<210> 226 

<211> 74 

<212> PRT 

<213> Homo sapiens 
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<220> 
<221> SITE 
<222> (38) 

<223> Xaa equals any of the naturally occurring L-anino acids 

<220> 
<221> SITE 
<222> (48) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> 
<221> SITE 
<222> (54) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE > 
<222> (551 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (68) 

<223> xaa equals any of the naturally occurring L-araino acids 

<220> 
<221> SITE 
<222> (74) 

<223> Xaa equals stop translation 
<400> 226 

Met Phe Tyr Lys Leu Thr Leu He Leu Cys Glu Leu Ser Val Ala Gly 
1 5 10 15 

Val Thr Gin Ala Ala Ser Gin Arg Pro Leu Gin Arg Leu Pro Arg His 
20 25 30 

He Cys ser Gin Arg Xaa Pro Pro Gly Arg Cys Leu Leu Lys Ala Xaa 
35 40 45 

Leu Gin Thr Thr Trp Xaa Xaa Pro Asp Lys Pro lie Pro Arg Leu Ser 
50 55 eo 

Pro Pro Leu Xaa Ser Asp Pro Lys Arg Xaa 
65 70 



<210> 227 
<211> 167 
<212> PRT 
<213> Homo sapiens 

<220> 
<221> SITE 
<222> (167) 

<223> Xaa equals stop translation 
<400> 227 
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Met Gly Ser Arg Phe Leu Leu Val Leu Leu Ser Gly Leu Thr Val Leu 
1 5 . 10 15 • 

Leu Ala Leu Pro Gly Ser Glu Ala Lys Asn Sex Gly Ala Ser Cvs Pro 
20 25 30 " 

Pro Cys Pro Lys Tyr Ala Ser Cys His Asr. Ser Thr His Cys Thr Cys 
35 40 45 

Glu Asp Gly Phe Arg Ala Arg Ser Gly Arg Thr Tyr Phe His Asp Ser 
50 55 so 

Ser Glu Lys Cys Glu Asp He Asn Glu Cys Glu Thr Gly Leu Ala Lys 



80 



Cys Lys Tyr Lys Ala Tyr Cys Arg Asn Lys Val Gly Gly Tyr He Cys 
85 90 g5 

Ser Cys Leu Val Lys Tyr Thr Leu Phe Asn Phe Leu Ala Gly He He 
!00 105 no 

Asp Tyr Asp His Pro Asp Cys Tyr Glu Asn Asn Ser Gin Gly Thr Thr 
115 120 125 

Gin Ser Asn Val Asp He Trp Val Ser Gly Val Lys Pro Gly Phe Gly 
130 135 i 4 o 

Lys Gin Leu Val Arg He Thr Met Pro Phe Ser Tyr Pro Asn He Asn 
145 • 150 155 160 

Met Ser Ser Cys Asp Phe Xaa 
165 



<210> 228 
<211> 71 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (71) 

<223> Xaa equals stop translation 
<400> 228 

Met Lys Pro Lys His Leu Glu Trp Cys Leu Ala His Ser Trp Cys Val 
1 5 10 is 

He Trp Leu Ser Phe Val Ser Pro Pro Thr ser His Leu Glu Cys Asp 
20 25 30 

Gly Phe Pro Gly Ser Leu Leu Pro Pro Cys Glu Glu Gly Arg Cys Phe 
35 40 45 

Pro Phe Thr Phe His His His Asp Cys His Gly Cys Ser Pro Leu Gin 
50 55 so 

Ser Ser pro Gly Gin His Xaa 
G 5 70 
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<210> 229 
<211> 273 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (273) 

<223> Xaa equals scop translation 
<40O> 229 

Met Cys Cys Trp Pro Leu Leu Leu Leu Trp Gly Leu Leu Pro Gly Thr 
1 5 10 15 

Ala Ala Gly Gly Ser Gly Apj Thr Tyr Pro His Arg Thr Leu Leu Asp 
20 ' 25 30 

ser Glu Gly Lys Tyr Trp Leu Gly Trp Ser Gin Arg Gly Ser Gin He 
35 40 45 

Ala Phe Arg Leu Gin Val Arg Thr Ala Gly Tyr Val Gly Phe Gly Phe 
50 55 60 

Ser Pro Thr Gly Ala Met Ala Ser Ala Asp He Val Val Gly Gly Val 
65 70 75 80 

Ala His Gly Arg Pro Tyr Leu Gin Asp Tyr Phe Thr Asn Ala Asn Arg 
85 . 90 95 

Glu Leu Lys Lys Asp Ala Gin Gin Asp Tyr His Leu Glu Tyr Ala Met 
100 105 110 

Glu Asn Ser Thr His Thr He He Glu Phe Thr Arg Glu Leu His Thr 
115 120 125 

Cys Asp He Asn Asp Lys Ser He Thr Asp Ser Thr Val Arg Val He 
130 135 140 

Trp Ala Tyr His His Glu Asp Ala Gly Glu Ala Gly Pro Lys Tyr His 
145 150 155 ieo 

Asp ser Asn Arg Gly Thr Lys Ser Leu Arg Leu Leu Asn Pro Glu Lys 
1G5 170 . 175 

Thr Ser Val Leu Ser Thr Ala Leu Pro Tyr Phe Asp Leu Val Asr Gin 
180 185 lgo 

Asp val Pro He Pro Asn Lys Asp Thr Thr Tyr Trp Cys Gin Met *>he 
l 95 200 205 

Lys He Pro Val Phe Gin Glu Lys His His Val He Lys Val Glu Pro 
210 215 220 

Val He Gin Arg Gly .His Glu Ser Leu Val His His Tie Leu Leu -yr 
225 230 235 . 240 

Gin Cys Ser Asn Asn Phe Asn Asp Ser Val Fro Gly He Arg Ala Arg 
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245 250 255 

lie Ala lie Thr Pro Thr Cys Pro Met His Ser Ser Pro Val Lys Leu 
260 265 270 

Xaa 



<210> 230 
<211> 82 
<212> PRT 

<213> Homo sapiens 
<220> 

<22l> SITE 
<222> (82) 

<223> Xaa equals stop translation 
<400> 230 

Met Arg Pro Gly Thr Ala Leu Gin Ala Val Leu Leu Ala Val Leu Leu 
1 5 io is 

Val Gly Leu Arg Ala Ala Thr Gly Arg Leu Leu Ser Gly Glu Pro Val 
20 25 30 

Cys Arg Gly Gly Thr Gin Arg Pro Cys Tyr Lys Val He Tyr Phe His 
35 40 45 

Asp Thr Ser Arg Arg Leu Asn Phe Glu Glu Ala Lys Glu Ala Cys Arg 
50 55 60 

Arg Gly Trp Arg Pro Ala Ser Gin His Arg Val Leu Lys Met Asn Arg 
65 70 75 80 

Asn Xaa 



<210> 231 

<211> 71 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (38) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 231 

Met Ser Pro Leu Ser Ala Ala Arg Ala Ala Leu Arg Val Tyr Ala Val 
1 5 10 is 

Gly Ala Ala Val He Leu Ala Gin Leu Leu Arg Arg Cys Arg Gly Gly 
20 25 30 

Phe Leu Glu Pro Val Xaa Pro Pro Arg Pro Asp Arg Val Ala He Val 
35 40 45 
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Thr Gly Gly Thr Asp Gly lie Gly Tyr Ser Thr Ala Asn rl« Trp Arg 
50 55 60 

Asp Leu Ala Cys Met Leu Ser 
65 70 



<210> 232 
<211> 225 
<212> PRT 

<213> Homo sapiens 
<220> 

<22I> SITE 
<222> (5J 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 232 

His Glu Arg Ala xaa Gly Pro Ser Arg Gly His Gly Glu Leu Leu Ser 
1 5 io 3S 

Cys Val Leu Gly Pro Arg Leu Tyr Lys He Tyr Arg Glu Arg Asp Ser 
20 25 30 

Glu Arg Ala Pro Ala Ser Val Pro Glu Thr Pro Thr Ala Val Thr Ala 
35 40 45 

Pro His Ser Ser Ser Trp Asp Thr Tyr Tyr Gin Pro Arg Ala Leu Glu 
50 55 60 

Lys His Ala Asp Ser He Leu Ala Leu Ala Ser Val ?he Trp Ser He 
65 70 75 80 

Ser Tyr Tyr Ser Ser Pro Phe Ala Phe Phe Tyr Leu Tyr Arg Lys Gly 
85 go 95 

Tyr Leu Ser Leu Ser Lys Val Val Pro Phe Ser His Tyr Ala Gly Thr 
100 105 no 

Leu Leu Leu Leu Leu Ala Gly Val Ala Cys Ser Glu Ala Leu Ala Ala 
115 120 125 

Gly Pro Thr Pro Ser Thr Gly Ser Ser Ser Pro Ser Trp Lys Gin His 
130 135 140 

lie Gly Thr Ser Leu Gin Lys Thr Arg Gly Ser Leu Pro Thr Thr Thr 

145 15 o 155 160 

Leu Thr ser Gly Ala Gly Gin Ser Thr Ser Thr Gly Lys Asn Pro Ala 
165 170 175 

Ala Gly. Arg Ser Leu Glu Gly Ala Leu Pro Ala Gly Val Trp Pro Cvs 
180 las 190 

Phe Ala Gin Ser Pro Cys Thr Gly Gly Gin Gin Thr Pro Ser Ser Thr 
195 200 205 

Gly Leu Arg Ser Cys Leu Val Arg Ser Pro Ala Thr Trp Trp Arg Thr 
210 215 220 
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Pro 

225 



<210> 233 
<2L1> 314 
<2L2> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (147) 

<223> xaa equals any of the naturally occurring L-amino acids 

* <220> 

<221> SITE f 
<222> (211) 

<223> xaa equals any of the naturally occurring L-amino acids 
<400> 233 

Met Leu Pro Ala Arg Leu Pro Phe Arg Leu Leu Ser Leu Phe Leu Arg 
1 5 10 15 

Gly Ser Ala Pro Thr Ala Ala Arg His Gly Leu Arg Glu Pro Leu Leu 
20 25 30 

Glu Arg Arg Cys Ala Ala Ala Ser Ser Phe Gin His Ser Ser Ser Leu 
35 40 45 

Gly Arg Glu Leu Pro Tyr Asp Pro Val Asp Thr Glu Gly Phe Gly Glu 
50 55 60 

Gly Gly Asp Met Gin Giu Arg Phe Leu Phe Pro Glu Tyr lie Leu Asp 
65 70 75 80 

Pro Glu Pro Gin Pro Thr Arg Glu Lys Gin Leu Gin Glu Leu Gin Gin 
85 90 95 

Gin Gin Glu Glu Glu Glu Arg Gin Arg Gin Gin Arg Arg Glu Glu Arg 
100 105 no 

Arg Gin Gin Asn Leu Arg Ala Arg Ser Arg Glu His Pro Val Val Gly 
115 120 125 

His Pro Asp Pro Ala Leu Pro Pro Ser Glv Val Asn Cys Ser Gly Cys 
130 135 140 

Gly Ala Xaa Leu His Cys Gin Asp Ala Gly Val Pro Gly Tyr Leu Pro 
145 150 155 160 

Arg Glu Lys Phe Leu Arg Thr Ala Glu Ala Asp Gly Gly Leu Ala Arg 
165 170 175 

Thr Val Cys Gin Arg Cys Trp Leu Leu Ser His. Hi s Arg Arg Ala Leu 
180 185 igo 

Arg Leu Gin Val Ser Arg Glu Gin Tyr Leu Glu Leu Val Ser Ala Ala 
195 200 205 
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Leu Arg Xaa Pro Gly Pro Ser Leu Val Leu Tyr Met Val Asp Leu Leu 
210 215 220 

Asp Leu Pro Asp Ala Leu Leu Pro Asp Leu Pro Ala Leu Val Gly Pro 
225 230 235 240 

Lys Gin Leu Ji e Val Leu Gly Acn Lys Val Asp Leu Leu Pro Gin Asp 
245 250 255 

Ala Pro Gly Tyr Arg Gin Arg Leu Arg Glu Arg Leu Tn> Glu Asp Cys 
260 265 270 

Ala Arg Ala Gly Leu Leu Leu Ala Pro Gly Thr Lys Gly Hi a Ser Ala 
275 280 285 

Pro Ser Arg Thr Ser His Arg Thr Gly Arg lie Arg He Arg Arg Thr 
290 295 300 

Gly Pro Ala Gin Trp Ser Gly Thr Cys Gly 
305 310 



<210> 234 

<211> 93 

<212> PRT 

<213> Homo sapiens 

<400> 234 

Met Arg Pro Gin Gly Pro Ala Ala Ser Pro Gin Arg Leu Arg Gly Leu 
1 5 10 IS 

Leu Leu Leu Leu Leu Leu Gin Leu Pro Ala Pro Ser Ser Ala Ser Glu 
20 25 30 

He Pro Lys Gly Lys Gin Lys Ala His Ser Gly Arg G'.y Arg Trp Trp 
35 40 45 

Thr Cys lie Met Glu Cys Ala Tyr Lys Gly Gin Gin Glu Cys Leu Val 
50 55 60 

Glu Thr Gly Ala Leu Gly Pro Met Ala Phe Arg Val His Leu Gly Ser 
65 70 75 80 

Gin Val Gly Met Asp Ser Lys Glu Lys Arg Gly Asn Val 
85 90 



<210> 235 

<211> 73 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (73) 

<223> Xaa equals stop translation 
<40O> 235 
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Met Gly Ser Ala Ala Leu Glu He Leu Gly Leu Val Leu Cys Leu Val 

Gly Trp Gly Gly Leu lie Leu Ala Cys Gly Leu Pro Met Trp Gin Va 1 
20 25 30 

Thr Ala Phe. Leu Asp His Asn lie Val Thr Ala Gin Thr Thr Trp Lys 
35 40 45 

Gly Leu Trp Met Ser Cys Val Val Gin Ser Thr Gly Thr Cys S^r Ala 
b0 55 60 

Lys Cys Thr Thr Arg Cys Trp Leu Xaa 
65 70 



<210> 236 

<211> 349 f 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (283) 

<223> xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (293) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (325) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (326) 

<223> xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (349) 

<223> Xaa equals stop translation 
<400> 236 

Met Leu Cys Pro Trp Arg Thr Ala Asn Leu Gly Leu Leu Leu He Leu 
1 5 10 15 

Thr He Phe Leu Val Ala Glu Ala Glu Gly Ala Ala Gin Pro Asn Asn 
20 25 30 

Ser Leu Met Leu Gin Thr Ser Lys Glu Asn His Ala Leu Ala Ser Ser 
35 40 45 

Ser Leu Cys Met Asp Glu Lys Gin lie Thr Gin Asn Tyr Ser Lys Va" 
50 55 60 
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Leu Ala Giu Val Asn Thr Ser Trp Pro Val Lys Met Ala Thr Asn Ala 

Val Leu Cys Cys Pro Pro lie Ala Leu Arg Asn Leu lie lie lie Thr 
85 90 95 

Trp Glu He He Leu Arg Gly Gin Pro Ser Cys Thr Lvs Ala Tyr Lys 
100 105 " no 

Lys Glu Thr Asn Glu Thr Lys Glu Thr Asn Cys Thr Asp Glu Arg He 
115 120 12 5 

Thr Trp Val Ser Arg Pro Asp Gin Asn Ser Asp Leu Gin He Arg Thr 
130 135 14 0. 

Val Ala He Thr His Asp Gly Tyr Tyr Arg Cys He Met: Val Thr Pro 
145 150 # 155 160 

Asp Gly Asn Phe His Arg Gly Tyr His Leu Gin Val Leu Val Thr Pro 
!65 170 175 

Glu Val Thr Leu Phe Gin Asn Arg Asn Arg Thr Ala Val Cys Lys Ala 
180 las 190 

Val Ala Gly Lys Pro Ala Ala His He Ser Trp He Pro Glu Gly £so 
195 200 205 

Cys Ala Thr Lys Cln Glu Tyr Trp Ser Asn Gly Thr Val Thr Val Lvs 
210 215 220 

Ser Thr Cys His Trp Glu Val His Asn Val Ser Thr Val Asn Cys His 
225 230 235 240 

Val ser His Leu Thr Gly Asn Lys Ser Leu Tyr He Glu Leu Leu Pro 
245 250 255 

Val Pro Gly Ala Lys Lys Ser Ser Lys Leu Tyr He Pro Tyr He H e 
- 260 265 270 

Leu Thr He He He Leu Thr He Val Gly Xaa He Trp Leu Leu- Lys 
275 280 285 

Val Asn Gly Cys Xaa Lys Tyr Lys Leu Asn Lys Pro Glu Ser Th- Pro 
290 295 300 

Val Val Glu Glu Asp Glu Met Gin Pro Tyr Ala Phe Tyr Thr Glu Lvs 
305 310 315 120 



Asn Asn Pro Leu Xaa Xaa Thr Thr Asn Lys Val Lys Ala Ser Glu Ala 
325 330 335 

Leu Gin Ser Glu Val Asp Thr Asp Leu His Thr Leu Xaa 
340 345 



<210> 237 
<211> 17 
<212> PRT 
<213> Homo 



sapiens 
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<40O> 237 

Leu Ala Leu Tyr Ser Ala Leu Phe 
1 5 

Asn 



Ser Tyr Ser Gly Trp Asp Thr Leu 
10 IS 



<210> 238 
<211> 14 
<212> PRT 

<213> Homo sapiens 
, <400> 238 

; Val Thr Glu Glu He Lys Asn Pro Glu Arg Asn Leu Pro Leu 
1 • 5 10 

<210> 239 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 239 

lie Gly He Ser Met Pro He Val Thr 
1 5 

<210> 240 

<21l> 13 

<212> PRT 

<213> Homo sapiens 

<400> 240 

He Tyr lie Leu Thr Asn Val Ala Tyr Tyr Thr Val Leu 
1 5 10 



<210> 241 
<211> 11 
<212> PRT 

<213> Homo sapiens 
<400> 241 

Ser Asp Ala Val Ala Val Thr Phe Ala Asp Gin 
15 10 



<210> 242 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 242 

Val Ala Leu Ser Cys Phe Gly Gly Leu Asn Ala Ser He 
1 5 10 
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<210> 243 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 243 

Ser Arg Leu Phe Phe Val Gly Ser Arg Glu Gly His Leu Pro Asp 
1 5 10 is 

<210> 244 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 244 

Ser Phe Ser Tyr Trp Phe vPhe Val G!.y Leu Ser 
L 5 r 10 



<210> 245 - 
<211> 11 
<212> PRT 

<213> Homo sapiens 
<40O> 245 

Val Gly Gin Leu Tyr Leu Arg Trp Lys Glu Pro 
15 10 



<210> 246 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 246 

Arg Pro Arg Pro Leu Lys :eu Ser Val Phe Phe Pro He val Phe Cys 
1 5 10 15 



<210> 247 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 247 

Asp Thr He Asn Ser Leu He Gly He 
1 5 



<210> 248 

<211> 44 

<212> PRT 

<213> Komo sapiens 

<400> 248 
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AX a Thr Ala Leu 
1 

Gin Val Leu cys 
20 

Gly Glu Met Pro 
35 



Pro Pro Lys tie 
5 

Met Ser Val Ala 

Lys Gin Arg Asp 
40 



Val Gly Ser Ala 
10 



Ala Glu Met Asp 
25 



Pro Lys Ser Asn 



Thr Arg Tyr Leu 
15 

Leu Glu Asp Gly 
30 



<210> 249 
<211> 352 
<212> PRT 

<213> Horao sapiens 



<400> 249 

Leu Leu Ala Ala Ala Cys,, He Cys Leu Leu Thr Phe He Asn Cys \Ma 
1 5 10 15 

Tyr Val Lys Trp Gly Thr Leu Val Gin Asp He Phe Thr Tyr Ala Lys 
20 25 30 

Val Leu Ala Leu He Ala Val He Val Ala Gly He VaL Arg Leu Gly 
35 40 45 

Gin Gly Ala Ser Thr His Phe Glu Asn Ser Phe Glu Gly Se- Ser Phe 
50 55 S0 

Ala Val Gly Asp He Ala Leu Ala Leu Tyr Ser Ala Leu Phe Ser Tvr 
65 70 75 80 

Ser Gly Trp Asp Thr Leu Asn Tyr Val Thr Glu Glu He Lys Asn Pro 
85 go 95 

Glu Arg Asn Leu Pro Leu Ser He Gly He Ser Met Pro He Val Thr 
i-OO 105 no 

Tie He Tyr Tie Leu Thr Asn Val Ala Tyr Tyr Thr Val Leu Asp Met 
115 120 125 

Arg Asp He Leu Ala Ser Asp Ala Val Ala Val Thr Phe Ala Asp Gin 
130 135 140 

He Phe Gly He Phe Asn Trp He He Pro Leu Ser Val Ala Leu Ser ■ 
145 150 155 160 

Cys Phe Gly Gly Leu Asn Ala Ser He Val Ala Ala Ser Arg Leu Phe 
iSS 170 175 

Phe val Gly Ser Arg Glu Gly His Leu Pro Asn Ala He Cys Met He 
180 185 " 190 

Kis Val Glu Arg Phe Thr Pro Val Pro Ser Leu Leu Phe Asn Gly He 
L95 200 2C5 

Met Ala Leu He Tyr Leu Cys Val Glu Asp He Phe Gin Leu He Asn 
210 215 220 

Tyr Tyr Ser Phe Ser Tyr Trp Phe Phe Val Gly Leu Ser He Val Gly 
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225 230 235 240 

Gin Leu Tyr Leu Arg Trp Lys Glu Pro Asp Arg Pro Axg Pro Leu Lys 
245 250 255 

Leu Ser Val Phe Phe Pro He Val Phe Cys Leu Cys Thr He Pip L*u 
260 265 270 

Val Ala Val Pro Leu Tyr Ser Asp Thr He Asn Ser Leu He Gly He 
275 280 295 

Ala He Ala Leu Ser Gly Leu Pro Phe Tyr Phe Leu lie He Ara Val 
290 295 300 

> ™ G1U HiS LyS Pro Leu **** Leu Ar 9 Axg He Val Gly Ser Ala 

4 305 310 315 320 

Thr Arg Tyr Leu Gin Val Leu Cys Met Ser Val Ala Ala Glu Met Asp 
325 330 335 

Leu Glu Asp Gly Gly Glu Met Pro Lys Gin Arg Asp Pro Lys Ser Asn 
340 345 350 



<210> 250 
<211> 119 
<212> PRT 

<213> Homo sapiens 
<400> 250 

Ala Ala Arg Gly Ser Gly Val Arg Asp Pro Leu Glu Glu Ala Val Cys 
1 5 io 15 

Pro Phe Ser Asp Leu Gin Leu His Ala Gly Arg Thr Thr Ala Leu Phe 
20 25 30 

Lys Ala Val Arg Gin Gly His Leu Ser Leu Gin Arg Leu Leu Leu Ser 
35 40 45 

Phe Val Cys Leu Cys Pro Ala Pro Arg Gly Gly Ala Tyr Arg Gly Arg 
50 55 60 

Gin Ala Ser Leu Ser Cys Gly Gly r.au His Pro Val f^g Ala Ser Ar<r 
65 ™ 75 80 

Leu Leu Cys Leu Pro Lys Gin Ala Trp Ala Met Ala Gly Ala Pro Pro 
65 90 95 

Pro Val Ser Leu Pro Pro Cys Ser Leu He Ser Asp Cys Cys Ala Ser 
100 105 110 

Asn Gin Arg Asp Ser Val Gly 
115 



<210> 251 
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<2ll> 356 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (37) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (280) 

<223> Xaa equals any of the naturally occurring L- amino acids 
<400> 251 

Leu Ser Lys Ala Phe Leu Asp Ser Pro Asn Arg Leu Leu Ala Val Glu 
1 5 i 10 15 

Met Asn Thr Asp His Leu Arg Leu Thr Val Fro Asn Gly lie Gly Ala 
20 25 30 

Leu Lys Leu Arg Xaa Met Glu Kis Tyr Phe Ser Gin Gly Leu Ser Val 
35 40 45 

Gin Leu Phe Asn Asp Gly Ser Lys Gly Lys Leu Asn His Leu Cys Glv 
50 55 60 

Ala Asp Phe Val Lys Ser His Gin Lys Pro Pro Gin Gly Met Glu lie 
65 70 75 80 

Lys Ser Asn Glu Arg Cys Cys Ser Phe Asp Gly Asp Ala Asp Arg lie 
85 90 95 

Val Tyr Tyr Tyr His Asp Ala Asp Gly His Phe His Leu He Asp Gly 
100 105 HO 

Asp Lys He Ala Thr Leu He Ser Ser Phe Leu Lys Glu ^eu Leu Val 
US 120 125 

Glu lie Gly Glu Ser Leu Asn lie Gly Val Val Gin Thr Ala Tyr Ala 
130 135 140 

Asn Gly ser Ser Thr Arg Tyr Leu Glu Glu Val Met Lys Val Pro Val 
145 150 155 i 6 o 

Tyr Cys Thr Lys Thr Gly Val Lys His Leu His His Lys Ala Gin Glu 
165 170 175 

Phe Asp lie Gly Val Tyr Phe Glu Ala Asn Gly His Gly Thr Ala Leu 
180 185 190 

Phe Ser Thr Ala Val Glu Met Lys He Lys Gin Ser Ala Glu Glu Leu 
195 200 205 

Glu Asp Lys Lys Arg Lys Ala Ala Lys Met Leu Glu Asn He He Asp 
210 215 220 

Leu Phe Asn Gin Ala Ala Gly Asp Ala He Ser Asp Met Leu Val He 
225 230 235 240 
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Glu Ala lie Leu Ala Leu Lys Gly Leu Thr Val Gin Gin Trp Asp Ala 
245 250 255 

Leu Tyr Thr Asp Leu Pro Asn Arg Gin Leu Lys Val Gin Val Ala Asp 
260 265 270 

Arg Arg Val He Ser Thr Thr Xaa Ala Glu Arg Gin Ala Val Thr Pro 
275 280 285 

Pro Gly Leu Gin Glu Ala He Asn Asp Leu Val Lys Lys Tyr Lys Leu 
290 295 300 

Ser Arg Ala Phe Val Arg Pro Ser Gly Thr Glu Asp Val Val Arg Val 
305 310 315 320 

Tyr Ala Glu Ala Asp Serein Glu Ser Ala Asp His Leu Ala His Glu 
325 330 335 

Val Ser Leu Ala Val Phe Gin Leu Ala Gly Gly tie Gly Glu Arg Pro 
340 345 350 

Gin Pro Gly Phe 
355 



<210> 252 
<2ll> 26 
<212> PRT 

<213> Homo sapiens 
<400> 252 

Leu Ser Lys Ala Phe Leu Asp Ser 
1 5 

Met Asn Thr Asp His Leu Arg Leu 
20 



Pro Asn Arc Leu Leu Ala Val Glu 
10 is 

Thr Val 
2S 



<210> 253 
<211> 28 
<212> PRT 

< 213> Homo sapiens 
<220> 

<221> SITE 
<222> (11) 

<223> Xaa equals any of the naturally occurring L-anino acids 
<400> 253 

Pro Asn Gly He Gly Ala Leu Lys Leu Arg Xaa Met Glu His Tyr Phe 
1 5 io 15 

Ser Gin Gly Leu Ser Val Gin Leu Phe Asn Asp Gly 
20 25 



<210> 254 
<211> 28 
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<212> PRT 

<213> Homo sapiens 



<400> 254 

Ser Lys Gly Lys Leu Asn His Leu Cys Gly Ala Asp Phe Val Lys Ser 
1 5 io 15 

His Gin Lys Pro Pro Gin Gly Mat Glu lie Lys Ser 
20 25 



<210> 255 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 255 

Asn Glu Axg Cys Cys Ser Phe Asp Gly Asp Ala Asp Arg lie Val Tyr 
1 , 5 10 15 

Tyr Tyr His Asp Ala Asp Gly His Phe His Leu He 
20 25 



<210> 256 
<211> 2B 
<212> PRT 

<213> Homo sapiens 



<400> 256 

Asp Gly Asp Lys rie Ala Thr Leu He Ser Ser Phe Leu Lys Glu Leu 
1 5 10 15 

Leu Val Glu He Gly Glu Ser Leu Asn He Gly Val 
20 25 



<210> 257 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 257 

Val Gin Thr Ala Tyr Ala Ann Gly 
* 5 

Val Met Lys Val Pro Val Tyr Cys 
20 



Ser Ser Thr Arg Tyr Leu Glu Glu 
10 15 

Thr Lys Thr Gly 
25 



<210> 258 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 258 



a Lys His Leu His His Lys Ala Gin Glu Phe Asp He Gly Val Tyr 
1 5 io 1S 
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Phe Glu Ala Asn Gly His Gly Thr Ala Leu Phe Ser 
20 * 25 



<210> 259 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 259 

Thr Ala Val Glu Met Lys Tie Lys Gin Ser Ala Glu Gin Leu Glu Asp 
1 5 io 15 

Lys Lys Arg Lys Ala Ala Lys Met Leu. Glu Asn lie 
20 25 



<210> 260 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 260 

lie Asp Leu Phe Asn Gin Ala Ala Gly Asp Ala He Ser Asp Met Leu 
15 10 15 

Val lie Glu Ala He Leu Ala Leu Lys Gly Leu Thr 
20 25 



<210> 2S1 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 261 

Val Gin Gin Trp Asp Ala Leu Tyr Thr Asp Leu Pro Asn Arg Gin Leu 
1 5 io 15 

Lys Val Gin Val Ala Asp Arg Arg Val He Ser Thr 
20 25 



<210> 262 

<211> 28 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (2) 

<223> Kaa equals any of the naturally occurring L- amino acids 
<400> 262 

Thr Xaa Ala Glu Arg Gin Ala Val Thr Pro Pro Gly Leu Gin Glu Ala 
15 io IS 



lie Asn 'Asp Leu Val Lys Lys Tyr Lys Leu Ser Arg 
20 25 
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<210> 263 
<2ll> 24 
<212> PRT 

<213> Homo sapiens 
<400> 263 

Ala Phe Val Arg Pro Ser Gly Thr Glu Asp val val Arg Val Tyr Ala 
1 5 io 15 

Glu Ala Asp Ser Gin Glu Ser Ala 
20 



<210> 264 

<211> 26 r- 

<212> PRT 

<213> Hocio sapiens 

<400> 264 

Asp His Leu Ala His Glu Val Ser Leu Ala Val Phe Gin Leu Ala Gly 
15 10 15 

Gly lie Gly Glu Arg Pro Gin Pro Gly Phe 
20 25 



<210> 265 
<211> 443 
<212> PRT 
<213> Homo sapiens 

<400> 265 

Gly Thr Arg Ala Ala Pro Gly Leu Gly Ala Trp Gly Arg Arg Ser Pro 
1 5 10 is 

Pro Ser Phe Ser Pro Pro Arg Pro Arg Arq Pre Gly Val Met Ala Gly 
20 25 30 

Leu Asn Cys Gly Val Ser He Ala Leu Leu Gly Val Leu Leu Leu Gly 
35 40 45 

Ala Ala Arg Leu pro Arg Gly Ala Glu Ala Phe Glu He Ala Leu Pro 
50 55 60 

Arg Glu Ser Asn He Thr Val Leu He Lys Leu Gly Thr Pro Thr Leu 
65 70 75 bo 

Leu Ala Lys Pro Cys Tyr lie Val He Ser Lys Arg His lie Thr Met 
85 90 95 

Leu Ser He Lys Ser Gly Glu Arg He Val Phe Thr Phe Ser Cys Gin 
100 105 no 

Ser Pro Gin Asn His Phe Val He Glu He Gin Lys Aan He Asp Cys 
115 120 125 

Met ser Gly Pro Cys Pro Phe Gly Glu Val Gin Leu Gin Pro Ser Thr 
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130 135 140 

Ser Leu Leu Pro Thr Leu Asn Arg Thr Phe He Trp Aso Val Lys Ala 
14S 155 " ¥ 160 

His Lys Ser lie Gly Leu Glu Leu Gin Phe Ser Tie Pro Arg Leu Arg 
165 170 175 

Gin He Gly Pro Gly Glu Ser Cys Pro Asp Gly Val Thr His Ser lie 
180 185 190 

Ser Gly Arg He Asp Ala Thr Val VaL Arg He Gly Thr Phe Oys Ser 
195 200 205 

Asn Gly Thr Val Ser Arg He Lys Met Gin Glu Gly Val Lys Met Ala 
210 215 220 

Leu His Leu Pro Trp Phe His Pro Arg Asn Val Ser Gly Phe Ser ~le 
225 230 235 240 

Ala Asn Arg Ser Ser He Lys Arg Leu Cys He He Glu Ser Val Phe 
245 250 255 

Glu Gly Glu Gly Ser Ala Thr Leu Met Ser Ala Asn Tyr Pro Glu Gly 
260 265 270 

Phe Pro Glu Asp Glu Leu Met Thr Trp Gin Phe Val Val Pro Ala His 
275 280 285 

Leu Arg Ala Ser Val Ser Phe Leu Asn Phe Asn Leu Ser Asn Cys Glu 
290 295 300 

Arg Lys Glu Glu Arg Val Glu Tyr Tyr He Pro Gly Ser Thr Thr Asn 
305 315 320 

Pro Glu Val Phe Lys Leu Glu Asp Lys Gin Pro Gly Asn Met Ala Gly 
325 330 335 

Asn Phe Asn Leu Ser Leu Gin Gly Cys Asp Gin Asp Ala Gin Ser Pro 
340 345 35o 

Gly He Leu Arg Leu Gin Phe Gin Val Leu Val Gin His Pro Gin Asn 
355 360 365 

Glu Ser Asn Lys He Tyr Val Val Asp Leu Ser Asn Glu Arg Ala Met 
370 375 380 

Ser Leu Thr He Glu Pro Arg Pro Val Lys Gin Ser Arg Lys Phe Val 
385 390 395 400 

Pro Gly Cys Phe Val Cys Leu Glu Ser Arg Thr Cys Scr Ser Asn Leu 



Thr Leu Thr Ser Gly Ser Lys His Lys He Ser Phe Leu Cys Asp Asp 
420 425 430 

Leu Thr Arg Leu Trp Met Asn Val Glu Lys Pro 
435 440 
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<210> 266 

<211> 159 

<212> PRT 

<213> Homo sapiens 

<400> 266 

Phe Glu lie Ala Leu Pro Arg Glu Ser Asa lie Thr Val Leu He Lys 
1 5 io 15 

Leu Gly Thr Pro Thr Leu Leu Ala Lys Pro Cys Tyr He Val He Ser 
20 2S 30 

Lys Arg His He Thr Met Leu Ser lie. Lys Ser Gly Glu Arg He Val 
35 40 45 

Phe Thr Phe Ser Cys GXp Ser Pro Glu Asn His Phe Val He Glu He 
50 55 60 

Gin Lys Asn He Asp Cys Met Ser Gly Pro Cys Pro Phe Gly Glu Val 
65 7 ° 75 80 

Gin Leu Gin Pro Ser Thr Ser Leu Leu Pro Thr Leu Asn Arg Thr Phe 
85 go 95 

He Trp Asp Val Lys Ala His Lys Ser He Gly Leu Glu Leu Gin Phe 
100 105 no 

Ser He Pro Arg Leu Arg Gin He Gly Pro Gly Glu Ser Cys Pro Asp 
1*5 120 125 

Gly Val Thr His Ser He Ser Gly Arg He Asp Ala Thr Val Val Arg 
"0 135 i 4 o 

He Gly Thr Phe Cys Ser Asn Gly Thr Val Ser Arg He Lys Met 
145 150 155 



<210> 267 
<211> 9 
<212> PRT 

<213> Homo sapiens 
«400> 267 

Phe Val Arg Asp Pro Phe Val Arg Leu 
1 5 



<210> 268 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 268 

Phe Leu Phe Val Arg Asp Pro Phe Val Arg Leu He Ser 
1 5 io 



<210> 269 



WO 00/06698 



181 



PCT/US99/I7130 



<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 269 

Phe Leu Phe Val Arg Asp Pro Phe Val Arg Leu He Ser Ala Phe 



<210> 270 
<211> 380 
<212> prt 

<213> Homo sapiens 
<400> 270 

Tyr Leu His Thr Ser Phe Ser Arg Pro His Thr Gly Pro Pro Leu Pro 
1 5 y 10 15 

Thr Pro Gly Pro Asp Arg Asp Arg Glu Leu Thr Ala Asp Ser Asp Val 
20 25 30 

Asp Glu Phe Leu Asp Lys Phe Leu Ser Ala Gly Val Lys Gin Ser Asp 
35 40 45 ' 

Leu Pro Arg Lys Glu Thr Glu Gin Pro Pro Ala Pro Gly Ser Met Glu 
50 S5 60 

Glu Asn Val Arg Gly Tyr Asp Trp Ser Pro Arg Asp Ala Arg Arg Ser 
65 70 75 80 

Pro Asp Gin Gly Arg Gin Gin Ala Glu Arg Axg Ser Val Leu Arg Gly 
85 go 35 

Phe Cys Ala Asn Ser Ser Leu Ala Phe Pro Thr Lys Glu Arg Ala Phe 
100 105 no 

Asp Asp He Pro Asn Ser Glu Leu Ser His Leu He Val Asp Asp Arg 
US 120 125 

His Gly Ala He Tyr Cys Tyr Val Pro Lys Val Ala Cys Thr Asn Trp 
130 135 140 

Lys Arg Val Met He Val Leu Ser Gly Ser Leu Leu His Arg Gly Ala 
145 I 50 155 i 6 o 

Pro Tyr Arg Asp Pro Leu Arg He Pro Arg Giu His Val His Asn Ala 
165 170 175 

Ser Ala His Leu Thr Phe Asn Lys Phe Trp Arg Arg Tyr Gly Lys Leu 
180 185 190 

Ser Arg His Leu Ker Lys Val Lys Leu Lys Lys Tyr Thr Lys Phe Leu 
195 200 205 

Phe Val Arg Asp Pro Phe Val Arg Leu lie Ser Ala Phe Arg Ser Lys 
210 215 220 

Phe Glu Leu Glu Asn Glu Glu Phe Tyr Arg Lys Phe Ala Val Pro Met 
225 230 235 240 
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Leu Arg Leu Tyr Ala Asn His Thr Ser Leu Pro Ala Ser Ala Arg Glu 
245 250 255 

Ala Phe Arg Ala Gly Leu Lys Val Ser Phe Ala Asn Phe lie Gin Tyr 
260 265 270 

Leu Leu Asp Pro His Thr Glu Lys Leu Ala Pro Phe Asn Glu His Trp 
27 5 280 2B5 

Arg Gin Val Tyr Arg Leu Cys His Pro Cys Gin lie Asp Tyr Asp Phe 
290 295 300 

Val Gly Lys Leu Glu Thr Leu Asp Glu. Asp Ala Ala Gin Leu Leu Gin 
305 310 315 320 

Leu Leu Gin Val Asp Arg ^Gln Leu Arg Phe Pro Pro Ser Tyr Arg Asn 
325 330 335 

Arg Thr Ala Ser Ser Trp Glu Glu Asp Trp Phe Ala Lys Tie Pro leu 
340 345 350 

Ala Trp Arg Gin Gin Leu Tyr Lys Leu Tyr Glu Ala Asp Phe Val Leu 
355 360 365 

Phe Gly Tyr Pro Lys Pro Glu Asn Leu Leu Arg Asp 
370 375 380 

<210> 271 
<211> 274 
<212> PRT 
<213* Homo sapiens 

<400> 271 

Lys Leu Val Arg Leu Gin Val Pro Val Arg Asn Ser Arg Val Asp Pro 
1 5 io 15 

Arg Val Arg ser Lys He Gly Ser Arg Arg Trp Met Leu Gin Leu lie 
20 25 30 

Met Gin Leu Gly Ser Val Leu Leu Thr Arg Cys Pro Phe Trp Gly Cys 
35 40 4 S 

Phe Ser Gin Leu Met Leu Tyr Ala Glu Arg Ala Glu Ala Arg Arg Lys 
50 55 60 

Pro Asp lie Pro Val Pro Tyr Leu Tyr Phe Asp Met Gly Ala Ala Val 
65 70 75 80 

Leu Cys Ala Ser Phe Met Ser Phe Gly Val Lys Arg Arg Trp Phe Ala 
85 90 95 

Leu Gly Ala Ala Leu Gin Leu Ala lie Ser Thr Tyr Ala Ala Tyr He 
100 105 no 

Gly Gly Tyr Val His Tyr Gly Asp Trp Leu Lys Val Arg Met Tyr- Ser 
H5 12Q 12 5 
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Arg Thr Val Ala He He Gly Giy Phe Leu Val Leu Ala Ser Gly Ala 
130 135 140 

Gly Glu Leu Tyr Arg Arg Lys Pro Arg Ser Arg Ser Leu Gin Ser Thr 
145 150 155 160 

Gly Gin Val Phe Leu Gly He Tyr Leu lie Cys Val Ala Tyr Ser Leu 
16S 170 175 

Gin His Ser Lys Glu Asp Arg Leu Ala Tyr Leu Asn His Leu Pro Gly 
180 185 190 

Gly Glu Leu Met lie Gin Leu Phe Phe Val Leu Tyr Gly He Leu Ala 
195 200 205 

Leu Ala Phe Leu Ser Gly Tyr Tyr Val Thr Leu Ala Ala Gin He Leu 
210 /215 220 

Ala Val Leu Leu Pro Pro Val Met Leu Leu He Asp Gly Asn Val Ala 
225 230 235 240 

Tyr Trp His Asn Thr Arg Arg Val Glu Phe Trp Asn Gin Met Lys Leu 
245 250 255 

Leu Gly Glu Ser Val Gly He Phe Gly Thr Ala Val He Leu Ala Thr 
260 265 270 

Asp Gly 



<210> 272 
<211> 203 
<212> PRT 

<213> Homo sapiens 
<400> 272 

Met Gin Leu Gly Ser Val Leu Leu Thr Arg Cys Pro Phe Trp Gly Cys 
1 5 10 15 

Phe Ser Gin Leu Met Leu Tyr Ala Glu Arg Ala Glu Ala Arg Arg Lys 
20 25 30 

Pro Asp He Pro Val Pro Tyr Leu Tyr Phe Asp Met Gly Ala Ala Val 
35 40 45 

Leu Cys Ala Ser Phe Met Ser Phe Gly Val Lys Arg Arg Trp Phe Ala 
50 55 60 

Leu Gly Ala Ala Leu Gin Leu Ala He Ser Thr Tyr Ala Ala Tyr He 
65 70 75 80 

Gly Gly Tyr Val His Tyr Gly Asp Trp Leu Lys Val Arg Met Tyr Ser 
85 90 95 

Arg Thr Val Ala He He Gly Cly Phe Leu Val Leu Ala Ser Gly Ala 
100 105 HO 

Gly Glu Leu T/r Arg Arg Lys Pro Arg Ser Arg Ser Leu Gin Ser Thr 
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U5 120 125 

Gly Gin Val Phe Leu Gly He Tyr Leu He Cys Val Ala Tyr Ser Leu 
130 135 140 

Gin His Ser Lys Glu Asp Arg Leu Ala Tyr Leu Asn His Leu Pro Gly 
145 150 155 i 6 o 

Gly Glu Leu Met He Gin Leu Phe Phe val Leu Tyr Gly He Leu Ala 
155 170 175 

Pro Gly Leu Ser Val Arg Leu Leu Arg Asp Pro Arg Cys Pro Asp Pro 
180 185 190 

Gly Cys. Thr Ala Ala Pro Cys His Ala Ala His 
195 200 



<210> 273 
<211> 407 
<212> PRT 
<213> Homo sapiens 

<400> 273 

Ser Asri Glu He Leu Leu Ser Phe Pro Gin Asn Tyr Tyr He Gin Trp 
15 10 15 

Leu Asn Gly Ser Leu He His Gly Leu Trp Asn Leu Ala Ser Leu Phe 
20 25 30 

Ser Asn Leu Cys Leu Phe Val Leu Ket Pro Phe Ala Phe Phe Pha Lou 
35 40 45 

Glu Ser Glu Gly Phe Ala Gly Leu Lys Lys Gly lie Arg Ala Arg He 
50 55 60 

Leu Glu Thr Leu Val Mat Leu Leu Leu Leu Ala Leu Leu He Leu Gly 
65 ™ 75 80 

He Val Trp Val Ala Ser Ala Leu He Asp Asr. Asp Ala Aia Ser Met 
85 90 95 

Glu Ser Leu Tyr Asp Leu Trp Glu Phe Tyr Leu Pro Tyr Leu Tyr Ser 
1°° 105 . no 

Cys He Ser Leu Met Gly Cys Leu Leu Leu Leu Leu Cys Thr Pro Val 
115 120 125 

Gly Leu ser Arg Met Phe Thr Val Met Gly His Leu Leu Val Lys Pro 
130 135 140 

Thr He Lou Glu Asp Leu Asp Glu Gin He Tyr He He Thr Leu G^u 
145 150 155 iio 

Glu Glu Ala Leu Gin Arg Arg Leu Asn Gly Leu Ser Ser Ser Va' Glu 
1SS 170 175 

Tyr Asn He Met Glu Leu Glu Gin Glu Leu Glu Asn Val Lys Thr Leu 
180 185 190 
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Lys Thr Lys Leu Glu Arg Arg Lys Lys Ala Ser Ala Trp Glu Arg Asn 
195 200 205 

Leu Val Tyr Pro Ala Val Met Val Leu Leu Leu lie Glu Thr Ser He 
210 215 220 

Ser Val Leu Leu Vol Ala Cys Asn He Leu Cys Leu Leu Val Asp Glu 
225 230 235 240 

Thr Ala Met Pro Lys Gly Thr Arg Gly Pro Gly He Gly Asn Ala Ser 
245 250 255 

Leu ser Thr Phe Gly Phe Val Gly Ala . Ala Leu Glu He He Leu He 
260 265 270 

Phe Tyr Leu Met Val Ser Ser Val Val Gly Phe Tyr Ser Leu Arg Phe 
275 280 285 

Phe Gly Asn Phe Thr Pro Lys Lys Asp Asp Thr Thr Met Thr T.ys He 
290 295 300 

He Gly Asn Cys Val Ser He Leu Val Leu Ser Ser Ala Leu Pro Val 
305 310 315 320 

Met Ser Arg Thr Leu Gly He Thr Arg Phe Asp Leu Leu Gly Asp Phe 
325 330 335 

Gly Arg Phe Asn Trp Leu Gly Asn Phe Tyr He Val Leu Ser Tyr Asn 
340 345 350 

Leu Leu Phe Ala He Val Thr Thr Leu Cys Leu Val Arg Lys Phe Thr 
355 360 365 

Ser Ala Val Arg Glu Glu Leu Phe Lys Ala Leu Gly Leu His Lys Leu 
370 375 380 

His Leu Pro Asn Thr Ser Arg Asp Ser Glu Thr Ala Lys Pro Ser Val 
385 390 395 400 

Asn Gly His Gin Lys Ala Leu 
405 



<210> 274 
<211> 165 
<212> PRT 

<213> Homo sapiens 
<400> 274 

Arg Ser Tyr Met Gin Ser Val Trp 
1 5 

Leu Asn Ala Ser He Thr Glu Thr 
20 



*?hr Glu Glu Ser Gin Cys Thr Leu 
10 15 

Phe Asn Cys Ser Phe Ser Cys Gly 
25 30 



Asp Cys Trp Lys Leu Ser Gin Tyr Pro Cys Leu Gin Val Tyr Val 
35 40 ' 45 
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Asn Leu Thr Ser Ser Gly Glu Lys Leu Leu Leu Tyr His Thr Glu Glu 
50 55 60 

Thr He Lys He Asn Gin Lys Cys Ser Tyr He Pro Lys Cys Gly Lys 
65 70 75 go 

Asn Phe Glu Glu Ser Met Ser Leu Val Asn Val Val Met Glu Asn Phe 
85 90 . 95 

Arg Lys Tyr Gin His Phe Ser Cys Tyr Ser Asp Pro Glu Gly Asn 31 n 
100 105 no 

Lys Ser Val He Leu Thr Lys Leu Tyr Ser Ser Asn Val Leu Phe His 
115 120 125 

Ser Leu Phe Trp Pro Thr Cys Met Met Ala Gly Gly Val Ala He Val 
130 140 

Ala Met Val Lys Leu Thr Gin Tyr Leu Ser Leu Leu Cys Glu Arg He 
145 ISO 155 160 

Gin Axg He Asn Arg 
165 



<210> 275 
<211> 155 
<212> PRT 

<213> Homo sapiens 
<40O> 275 

Ala Phe Ala His Leu Gin Leu Gly Pro Met Trp Lys Leu Trp Arg Ala 
1 5 10 15 

Glu Glu Gly Ala Ala Ala Leu Gly Gly Ala Leu Phe Leu Leu Leu Phe 
20 25 30 

Ala Leu Gly Val Arg Gin Leu Leu Lys Gin Arg Arg Pro Met Gly Phe 
35 40 45 

Pro Pro Gly Pro Pro Gly Leu Pro Phe He Gly Asn He Tyr Ser Leu 
50 55 60 

Ala Ala Ser Ser Glu Leu Pro His Val Tyr Met Arg Lys Gin Ser Gin 
65 70 75 80 

Val Tyr Gly Glu Val Gin Pro Arg Arg Ala Pro Gly Arg Glu Gly Arg 
85 90 95 

Gin Ala Gly Pro Gly Trp Pro Gly Pro Ser Trp Leu Asp Leu Trp Pro 
100 105 HO 

Pro Leu Gly Arg Leu Val Gly Thr Ser Pro Cys Ala Gly Cys Pro Leu 
115 120 125 

Arg Aap Thr Arg Phe Pro Gly Leu Glu Gly Arg Ser Pro Arq Ara Ara 
130 135 140 

Ala Pro Leu Gin Gly Glu Pro Arg Pro Cys Arg 
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145 150 155 



<210> 276 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<400> 276 

Met Arg Val Arg lie Gly Leu Thr Leu Leu Leu Cys Ala Val Leu Leu 
15 10 15 

Ser Leu Ala Ser Ala Ser Ser Asp Glu Glu Gly Ser Gin Asp Glu Ser 
20 25- 30 

Leu Gly Pha Gin Asp Tyr Phe Asp lie Arg 
35 *■ 40 



<21G> 277 
<211> 155 
<212> PRT 
<213> Homo sapiens 

<400> 277 

Met Ala Arg Gly Ser Leu Arg Arg Leu Leu Arg Leu Leu Val Leu Gly 
15 10 15 

Leu Trp Leu Ala Leu Leu Arg Ser Val Ala Gly Glu Gin Ala Pro Gly 
20 25 30 

Thr Ala Pro Cys Ser Arg Gly Ser Ser Trp Ser Ala Asp Leu Asp Lys 
35 40 45 

Cys Met Asp Cys Ser Thr Ser Cys Pro Leu Pro Ala Ala Leu Ala s 
50 55 €0 

Pro Trp Gly Arg Ser Glu Pro Asp Leu Arg Ala Gly Ala Ala Phe Trp 
65 70 75 80 

Leu Phe Gly Leu Glu Thr Met Pro Gin Glu Arg Glu Val His His Pro 
85 90 95 

His Arg Gly Asp Arg Arg Arg Gly Leu Pro Ser Cys Gly Ala Asp Pro 
100 105 no 

Val Thr Met Cj's Pro Leu Pro Ala Gly Ala Arg Pro Leu lie lie His 
115 120 125 

Ser Ser He Leu Glu Pro Val Ser Ala Ser Gin Thr Arg Arg Glu Pro 
130 135 140 

Ser Ser Ser Asn His Lys Gly Gly Gly Gly Arc 
145 iso 155 



<210> 278 
<211> 207 
<212> PRT 
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<213> Homo sapiens 
<400> 278 

Gly Thr Ser Phe Leu Asp Pro Thr Leu Ser Leu Phe Val Leu Glu Lys 
15 io is 

Phe Asn Leu Pro Ala Gly Tyr Val Gly Leu Val Phe Leu Gly Met Ala 
20 25 30 

Leu Ser Tyr Ala lie Ser Ser Pro Leu Phe Gly Leu Leu Ser Asp Lys 
35 40 45 

Arg Pro Pro Leu Arg Lys Trp Leu Leu Val Phe Gly Asn Leu lie Thr 
50 55 60 

Ala Gly Cys Tyr Met Leu Leu Gly Pro Val Pro lie Leu His lie Lys 
65 70/ 75 80 

Ser Gin Leu Trp Leu Leu Val Leu He Leu Val Val Ser Gly Leu Ser 
85 90 95 

Ala Gly Met Ser Tie lie Pro Thr Phe Pro Glu He Leu Ser Cys Ala 
100 105 1L0 

His Glu Asn Gly Phe Glu Glu Gly Leu Ser Thr Leu Gly Leu Val Ser 
115 120 125 

Gly Leu Phe Ser Ala Met Trp Ser He Gly Ala Phe Met G"y Pro Thr 
130 135 140 

Leu Gly Gly Phe Leu Tyr Glu Lys He Gly Phe Glu Trp Ala Ala Ala 
H5 150 155 160 

He Gin Gly Leu Trp Ala Leu He Ser Gly Leu Ala Met Gly Leu Phe 
165 170 175 

Tyr Leu Leu Glu Tyr Ser Arg Arg Lys Arg Ser Lys Ser Gin Asn He 
180 185 190 

Leu Ser Thr Glu Glu Glu Arg Thr Thr Leu Leu Pro Asn Glu Thr 
195 200 205 



<210> 279 
<211> 85 
<212> prt 

<213> Homo sapiens 
<400> 279 

Gly Thr Arg Glu Ala Arg Leu Arg Asp Leu Thr Arg Phe Tyr Asp Lys 
IS 10 is 

Val Leu Ser Leu His Glu Asp Ser Thr Thr Pro Val Ala Asn Pro Leu 
20 25 30 

Leu Ala Phe Thr Leu He Lys Arg Leu Gin Ser Asp Trp Arg Asn Vel 
35 40 45 

Val His Ser Leu Glu Ala Ser Glu Asn He Arg Ala Leu Lys Asp Gly 
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50 



55 



60 



Tyr Glu Lys Val Glu Gin Asp Leu Pro Ala Phe Glu Asp Leu Glu Gly 
65 70 75 80 



Ala Ala Arg Ala Leu 
B5 



<210> 280 

<2U> 7 

<212> PRT 

<213> Homo sapiens 

<400> 280 

Ala Leu Met Arg Leu Gin Asp 
1 5 



<210> 281 
<211> 7 
<212> PRT 

<213> Homo sapiens 
<400> 281 

Val Glu Ala Gly Gly Ala Thr 
1 5 



<=210> 282 
<211> 489 
<212> PRT 

<213> Homo sapiens 
<400> 282 

Gly Thr Arg Glu Ala Arg Leu Arg Asp Leu Thr Arg Phe Tyr Asp Lys 
1 S io 15 

val Leu Ser Leu His Glu Asp Ser Thr Thr Pro Val Ala Asn Pro Leu 
20 25 30 

Lau Ala Phe Thr Leu lie Lys Arg Leu Gin Ser Asp Trp Arg Asn Val 
35 40 45 

Val His Ser Leu Glu Ala Ser Glu Asn He Arg Ala Leu Lys Asp Gly 
50 55 60 

Tyr Glu Lys Val Glu Gin Asp Leu Pro Ala Phe Glu Asp Leu Glu Gly 
65 70 75 80 

Ala Ala Arg Ala Leu Met Arg Leu Gin Asp Val Tyr Met Leu Asn Val 
85 90 95 

Lys Gly Leu Ala Arg Gly Val Phe Gin Arg Val Thr Gly Ser Ala lie 
100 105 no 

Thr Asp Leu Tyr Ser Pro Lys Arg Leu Phe Ser Leu Thr Gly Asp Asp 
115 120 125 
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Cys Phe Gin Val Gly Lys Val Ala Tvr Asp Het Gly Asp Tyr Tyr His 
130 135 140 

Ala lie Pro Trp Leu Glu Glu Ala Val Ser Leu Phe Arg Gly Ser Tyr 
145 150 155 160 

Gly Glu Trp Lys Thr Glu Asp Glu Ala Ser Leu Glu Asn Ala Leu Asp 
165 170 " 175 

His Leu Ala Phe Ala Tyr Phe Arg Ala Gly Asn Val Ser Cys Ala Leu 
180 185 190 

Ser Leu Ser Arg Glu Phe Leu Leu Tyr Ser Pro Asp Asn Lys Arg Met 
195 200 205 

Ala Arg Asn Val Leu Lys Tyr Glu Arg Leu Lou Ala Glu Ser Pro Asn 
210 /215 220 

His Val Val Ala Glu Ala Val He Gin Arg Pro Asn He Pro His Leu 
225 230 235 240 

Gin Thr Arg Asp Thr Tyr Glu Gly Leu Cys Gin Thr Leu Gly Ser Gin 
245 250 255 

Pro Thr Leu Tyr Gin He Pro Ser Leu Tyr Cys Ser Tyr Glu Thr Asn 
260 265 270 

Ser Asn Ala Tyr Leu Leu Leu Gin Pro He Arg Lys Glu Val He His 
275 280 285 

Leu Gl:i Pro Tyr He Ala Leu Tyr His Asp Phe Val Scr Asp Ser Glu 
290 295 300 

Ala Gin Lys He Arg Glu Leu Ala Glu Pro Trp Leu Gin Arg Ser Va"t 
3 °5 310 315 320 

Val Ala Ser Gly Glu Lys Gin Leu Gin Val Glu Tyr Arg He Ser Lys 
325 330 335 

Ser Ala Trp Leu Lys Asp Thr Val Asp Leu Lys Leu Val Thr Leu Asn 
340 345 

His Arg He Ala Ala Leu Thr Gly Leu Asp Val Arg Pro Pro Tyr Ala 
35 5 360 365 

Glu Tyr Leu Gin Val Val Asn Tyr Gly He Gly Cly His Tyr Glu Pro 
370 375 380 

His Phe Asp His Ala Thr Ser Pro Ser Ser Pro Leu Tyr Arg Met Lys 
385 390 395 400 

Ser Gly Asn Arg Val Ala Thr Phe Ket He Tyr Leu Ser Ser Val Glu 
405 410 415 

Ala GLy Gly Ala Thr Ala Phe He Tyr Ala Asn Leu Ser Val Pro Val 
420 425 430 

Val Arg Asn Ala Ala Leu Phe Trp Trp Asn Leu His Arg Ser Gly Glu 
435 440 445 
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Gly Asp Ser Asp Thr Leu His Ala Gly Cys Pro Val Leu Val Gly Asd 
450 455 460 

Lys Trp Val Ala Asn Lys Trp He His Glu Tyr Gly Gin Glu Phe Ara 

465 470 475 480 

Arg Pro Cys Ser Ser Ser Pro Glu Asp 
485 



<210> 283 
<211> 136 
<212> PRT 

<2 1 3 > Homo sapiens 
<400> 283 

lie Gin Pro Ser His Ala" Ala Leu Leu His Cys Arg Ser Thr Phe Arg 
15 10 15 

Lys Thr Glu Cys Leu Asp Pro Trp Trp Val Arg Arg Gin Leu Leu Gly 
20 25 30 

Met Ala Gly lie Gly Gly Leu Gin Lys Met Lys Ala Pro His Thr Gly 
35 40 45 

Val Leu His Leu Gly Ser Val Trp Val The Leu Gly Tro Phe Leu Leu 
50 S5 6 o 

Gly Val Gly Tyr Thr Leu Thr Phe Asn Pro Leu Ser Gly Cys Met Ser 
65 7 0 75 80 

Thr Val Arg Trp Leu Asn Ser Asn He Thr Ala Asn Arg Thr Leu Ser 
85 go 95 

Arg Ser Val Cys His Val Thr Pro Leu His Arg Ser Leu Ser Pro His 
100 . 105 HO 

Asp Gly Glu Tyr Leu Arg C-ln Met Leu Leu Asn Ser Ser Ser Arg Ala 
115 120 125 

Gly Glu Ala Gly Ser Trp Gly Tyr 
130 135 



<210> 284 
<211> A6 
<212> PRT 

<213> Homo sapiens 



<400> 284 

Cys Ser Ser Pro Pro Gly Arg Leu 
1 5 

Thr Leu Gly Lys His Gly Ser Leu 
20 

Pro Leu His Leu Aia Trp Lys Met 

35 40 



Pro Trp Cys Trp Thr Ala Pro Arg 
10 15 

lie Ser Thr Leu Arg Leu Thr Ala 
25 30 

Met Leu Ser Arg Lys Ala Leu Phe 
45 
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Val Leu Leu Asn Thr Pro Val Leu Phe His Ala Leu Glu Gly Arg Leu 
50 55 60 

Phe Ser Lys Leu Cys His His His Thr lie Gin Arg Thr Leu Thr Val 
65 70 75 80 

Pro Lys Phe Arg Ser Ser 
85 



<210> 285 
<2U> 75 
<212> PRT 

<213> Homo sapiens 



<400> 285 j. 



Arg Ser Pro Thr Ser Arg Val Gin Leu Leu Lys Arg Gin Ser Cys Pro 
1 5 10 15 

Cys Gin Arg Asn Asp Leu Asn Glu Glu Pro Gin His Phe Thr His Tyr 
20 25 30 

Ala lie Tyr Asp Phe lie Val Lys Gly Ser Cys Phe Cys Asn Gly H-s 
35 40 45 

Ala Asp Gin Cys lie Pro Val His Gly Phe Arg Pro Val Lys Ala Pro 
50 55 60 

Gly Thr Phe His Met Val His Gly Lys Cys Met 
65 70 75 



<210> 286 
<211> 296 
<212> PRT 

<213> Homo sapiens 
<400> 286 

His Asn Thr Ala Gly Ser His Cys Gin His Cys Ala Pro Leu Tyr Asn 
1 5 io 15 

Asp Arg Pro Trp Glu Ala Ala Asp Gly Lys Thr Gly Ala Pro Asn Glu 
20 25 30 

Cys Arg Thr Cys Lys Cys Asn Gly His Ala Asp Thr Cys His Phe Asp 
35 40 45 

Val Asn Val Trp Glu Ala Ser Gly Asn Arg Ser Gly Gly Val Cys Asp 
50 55 60 

Asp Cys Gin His Asn Thr Glu Gly Gin Tyr Cys Gin Arg cys Lys Pro 
65 70 75 80 

Gly Phe Tyr Arg Asp Leu Arg Arg Pro Phe Ser Ala Pro Asp Ala Cys 
85 90 95 

Lys Pro cys Ser Cys His Pro Val Gly Ser Ala Val Leu Pro Ala Asn 
100 1Q5 110 
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Ser Val Thr Phe Cys Asp Pro Ser Asn Gly Asp Cys Pro Cys Lys Pro 
115 120 125 

Gly Val Ala Gly Arg Arg Cys Asp Arg Cys Met Val Gly Tyr Trp Gly 
!30 135 140 

Phe Gly Asp Tyr Sly Cys Arg Pro Cys Asp Cys Ala Gly ser Cvs Asp 
145 150 155 ' 160 

Pro He Thr Gly Asp Cys lie Ser Ser His Thr Asp He Asp Trp Tyr 
165 170 175 

His Glu Val Pro Asp Phe Arg Pro Val His Asn Lys Ser Glu Pro Ala 
180 185 190 

Trp Glu Trp Glu Asp Ala Sin Gly Phe Ser Ala Leu Leu His Ser Gly 
195 200 205 

Lys Cys Glu Cys Lys Glu Gin Thr Leu Gly Asn Ala Lys Ala Phe Cys 
210 215 220 

Gly Met Lys Tyr ser Tyr Val Leu Lys He Lys lie Leu Ser Ala His 
225 230 235 240 

Asp Lys Gly Thr His Val Glu Val Asn Val Lys He Lys Lys Val Leu 
245 250 255 

Lys Ser Thr Lys Leu Lys lie Phe Arg Gly Lys Ala Asn He He Ser 
260 265 270 

Arg He Met Asp Gly Gin Arg Met His Leu Ser Asn Pro Gin Ser Trp 
275 280 285 

Phe Gly lie Pro Cys Ser Arg Thr 
290 295 



<210> 2B7 
<211> 37 
<212> PRT 

<213> Homo sapiens 
<400> 287 

Cys ASp Asp Cys Gin His Asn Thr Glu Gly Gin Tyr Cys Gin Arg Cys 
1 5 10 is 

Lys Pro Gly Phe Tyr Arg Asp Leu Arg Arg Pro Phe Ser Ala Pro Asp 
20 25 30 

Ala Cys Lys Pro Cys 
35 



<210> 288 
<211> 36 
<212> PRT 

<213> Homo sapiens 
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<4O0> 288 

Cys Pro Cys Lys Pro Gly Val Ala Gly Arc Arg Cys Asp Arg Cys Met 
1 5 1C 15 

Val Gly Tyr Trp Gly Phe Gly Asp Tyr Gly Cys Arg Pro Cys Asp Cys 
20 25 30 

Ala Gly Ser Cys 
35 



<21Q> 289 
<211> 66 
<212> PRT 

<213> Homo sapiens 
<400> 289 

Asn lie Ser Ser Gin Tyr Cys He Leu Lys Ser Leu Glu Met Met lie 
15 10 : 5 

Ser Gly Leu Lys Leu Leu Val Leu Phe Leu Lys Phe Ala Pro Glu Asn 
20 25 30 

Tyr Cys Leu Ser Thr Glu Thr Leu Gin Met Pro Asn Arg His Leu Arg 
35 40 45 

Leu Scr Lys Ala Thr Cys Tyr Leu Met Lys Cys Leu Leu *>ro Ser Tyr 
50 55 60 

Phe Glu 

65 



<210> 290 
<211> 88 
<212> PRT 

<213> Hoino sapiens 
<400> 290 

Pro lie Glu Gly Thr Pro Ala Gly Thr Gly Pre Glu Phe Pro Gly Arg 
15 10 15 

Pro Thr Arg Pro Gin Arg Met Arg Ser Leu He Ser Ser His Pro Cys 
20 25 30 

Gin His Leu Leu Leu Leu Leu Leu Leu Leu Phe Leu He Leu Ala He 
35 40 45 

Leu Val Asp Val Lys Trp Tyr Leu Val Leu Phe He Cys He Se- Leu 
50 55 60 

Met Thr Ser Asp Val Glu His Leu Phe Met Cys Leu Leu Ala He Arg 
65 70 75 80 

He Ser Ser Trp Arg Asn Val Tyr 
35 



<210> 291 
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<2U> GO 
<212> PRT 

<213> Homo sapiens 
<400> 291 

Asn Trp Val Pro Thr Cys Leu Cys Pro Ser Ala Pro Cys Ser Phe Hia 
15 10 15 

Leu Leu Ser Arg Phe Lys Cys Leu Phe Ser Pro Gin Arg Leu Thr Asp 
20 25 30 

He Phe Arg Arg Tyr Asp Thr Asp Gin Asp Gly Trp Xle Gin Val Ser 
35 40 45 

Tyr Glu Gin Tyr Leu Ser Met Val Phe Ser He Val 
SO 55 60 



<210> 292 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 292 

Gin Arg Leu Thr Asp lie Phe Arg Arg Tyr Asp Thr Asp Gin Asp Gly 
15 10 15 

Trp rie Gin Val Ser Tyr Glu Gin Tyr Leu Ser Met Val Phe Ser lie 
20 25 30 



<210> 293 
<211> 73 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (38) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals any ot the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (54) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (55) 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<220> 

<221> SITE 
<222> (63) 

<223> Xaa equals any of the naturally occurring L-araino acids 
<400> 293 

Met Phe Tyr Lys Leu Thr Leu lie Leu Cys Glu Leu Ser Val Ala Gly 
1 5 10 15 

Val Thr Gin Ala Ala Ser Gin Arg Pro Leu Gin Arq Leu Pro Arg His 
20 25 30 

lie Cys Ser Gin Arg Xaa pro Pro Gly Arg Cys Leu Leu Lys Ala Xaa 
35 40 45 

Leu Gin Thr Thr Trp Xaa Xaa Pro Asp Lys Pro lie Pro Arg Leu Ser 
50 **ss 60 

Pro Pro Leu Xaa Ser Asp Pro Lys Arg 
65 70 



<210> 294 
<211> 95 
<212> PRT 

<213> Homo sapiens 
<400> 294 

Thr Ser Ser Pro Val Phe Ser Phe Cys Ser Met Ala Val Arg Glu Pro 
15 10 15 

Asp His Leu Gin Arg Val Ser Leu Pro Arg Tyr Asn Val Ser Ala Ser 

20 25 . . 30 

Leu Gin Trp Leu Pro Cys His Arg lie Val Leu Gin Pro Trp His Met 
35 40 45 

Cys Ala Met Trp Glu Leu Gly Gin Val Leu Phe His Pro Val Ala Pro 
50 55 60 

Arg Glu Gly Ala Ala Pro Ser Pro Val Ser Thr Leu Thr Trp Pro Ser 
65 70 75 80 

Ser Cys Ser His Ser Glu Ser Thr Met Glu Leu Glu Leu Gin Phe 
85 90 95 



<210> 295 
<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 295 

Met Ala Val Arg Glu Pro Asp His Leu Gin Arg Val Ser Leu Pro Arg 
15 10 15 
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<210> 296 
<211> 7 
<212> PRT 

<213> Homo sapiens 
<400> 296 

Leu Pro Cys His Arg lie Val 
1 5 



<210> 297 
<211> 15 
<212> PRT 

<213> Homo sapiens 

<400> 297 j- 

Ser Leu Gin Trp Leu Pro Cys His Arg He Val Leu Gin Pro Trp 
1 5 10 15 



<210> 296 
<211> 454 
<212> PRT 

<213> Homo sapiens 
<40O> 298 

Cys Phe Lys Arg Lys Pro Lys Arg Glu His Cys Ser cys Pro He "hr 
15 10 15 

Tyr Gin Ser Leu Gly Asp lie Leu Asn Ala Ser Phe Phe Ser Lys Arg 
20 25 30 

Lys Gly Met Gin Glu Val Lys Leu Asn Ser Tyr Val Val Ser Gly Thr 
35 40 45 

He Gly Leu Lys Glu Lys He Ser Leu Ser Glu Pro Val Phe Leu Thr 
50 55 60 

Phe Arg His Asn Gin Pro Gly Asp Lys Arg Thr Lys His He Cys Val 
65 70 75 80 

Tyr Trp Glu Gly Ser Glu Gly Gly Arg Trp Ser Thr Glu Gly Cys Ser 
85 90 95 

His Val His Ser Asn Gly Ser Tyr Thr Lys Cys Lys Cys Phe His Leu 
100 105 no 

Ser Ser Phe Ala Val Leu Val Ala Leu Ala Pro Lys Glu Asp Pro Val 
115 120 125 

Leu Thr Val He Thr Gin Val Gly Leu Thr He Ser Leu Leu Cys Leu 
130 135 i 4 o 

Phe Leu Ala He Leu Thr Phe Leu Leu Cys Arg Pro He Gin Asn Th>- 
145 150 155 160 

Ser Thr Ser Leu His Leu Glu Leu Ser Leu Cys Leu Phe Leu Ala His 
165 17C 175 
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Leu Leu Phe Leu Thr Gly lie Asn Arg Thr Glu Pro Glu Val Leu Cys 
180 185 190 

Ser lie lie Ala Gly Leu Leu His Phe Leu Tyr Leu Ala Cys Phe Thr 
195 200 205 

Trp Mst Lsu Leu Glu Gly Leu His Leu Phe Leu Thr Val Arg Asn Leu 
210 215 220 

Lys Val Ala Asn Tyr Thr Ser Thr Gly Arg Phe Lys Lys Arg Phe Met 
225 230 235 240 

Tyr Pro Val Gly Tyr Gly He Pro Ala Val He lie Ala Val Ser Ala 
245 250 255 

He val Gly Pro Gin As^ Tyr Gly Thx Phe Thr His Cys Trp Leu Lys 
260 265 270 

Leu Asp Lys Gly Phe Tie Trp Ser Phe Met Gly Pro Val Ala Val He 
275 2B0 285 

He Leu He Asn Leu Val Phe Tyr Phe Gin Val Leu Trp He Leu Arg 
290 295 300 

Ser Lys Leu Ser Ser Leu Asn Lys Glu Val Ser Thr He Glr. Asp Thr 
305 310 315 320 

Arg Val Met Thr Phe Lys Ala He Ser Gin Leu Phe He Leu Gly Cys 
325 330 335 

Ser Trp Gly Leu Gly Phe Phe Met Val Glu Glu Val Gly Lys Thr He 
340 345 350 

Gly Ser He lie Ala Tyr 5er Phe Thr He Tie Asn Thr Leu Gin Gly 
355 360 365 

Val Leu lcu Phe val Val His Cys Leu Leu Asn Arg Gin Val Arg Met 
370 375 380 

Glu Tyr Lys Lys Trp Phe Ser Gly Met Arg Lys Giy Val Glu Thr Glu 
385 390 395 400 

Ser Thr Glu Met Ser Arg Ser Thr Thr Gin Thr Lys Thr Glu Glu Val 
405 410 415 

Gly Lys Ser Ser Glu lie Phe His Lys Gly Gly Thr Ala Ser Ser Ser 
420 425 430 

Ala Glu Ser Thr Lys Gin Pro Gin Pro Gin Val His Leu Val Ser Ala 
435 440 445 

Ala Trp Leu Lys Met Asn 
450 



<210> 299 
<211> 101 
<212> PRT 
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<213> Homo sapiens 
<400> 299 

Phe Phe Trp Lys Glu Asn Leu Axg Arg Asn Gly Ser Arg Glu Asp Phe 
1 5 10 is 

Ala Arg Arg Ala Thx Gin Leu lie Gin Ser Val Glu Leu Ser He Tro 
20 25 30 

Asn Ala Ser Phe Ala Ser Pro Gly Lys Gly Gin lie Ser Glu Phe Asp 
35 40 45 

He val Tyr Glu Thr Lys Arg Cys Asn Glu Thr Arg Glu Asn Ala Phe 
50 55 60 

Leu Glu Ala Gly Asn Asn Thr Met Asp lie Asn Cys Ala Asp Ala Leu 
65 10 75 80 

Lys Gly Asn Leu Arg Glu Ser Thr Ala Val Ala Leu Ser Leu He Asn 
R5 go 95 

Leu Leu Gly He Phe 
100 



<210> 300 

<211> 27 

<212> PRT 

<213> Homo sapiens 

<4O0> 300 

Asp He Asn Glu Cys Glu Thr Gly Leu Ala Lys cys Lys Tyr Lys Ala 
15 10 15 

Tyr Cys Arg Asn Lys Val Gly Gly Tyr He cys 
20 25 



<210> 301 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 301 

Cys Arg Asn Lys Val Gly Gly Tyr He Cys Ser Cys 
15 10 



<210> 302 
<211> 33L 
<212> PRT 

<213> Homo sapiens 
<400> 302 

Ala Leu Cys Pro His Pro His Leu He Leu Asn Val Thr Val Ser Pro 
1 5 10 15 

Ala Pro Ser Cys Arg His Val Lys Lys Val Val Ala Ser Pro Ser Pro 
20 25 30 
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Ser Thr Thr Met lie Ala Met Asp Ala Pro His Ser Lys Ala Ala Leu 
35 40 45 

Asp Ser He Asn Glu Leu Pro Glu Asn lie Leu Leu Glu Leu Phe Thr 
50 ss 60 

His Val :>ro Ala Arg Gin Leu Leu Leu Asn Cys Arg Leu Val Cys Ser 
65 70 75 BO 

Leu Trp Arg Asp Leu He Asp Leu Met Thr Leu Trp Lys Arg Lys Cys 
85 90 95 

Leu Arg Glu Gly Phe He Thr Lys Asp Trp Asp Gin Pro Val Ala Asp 
100 105 HO 

Trp Lys He Phe Tyr PhefLeu Arg Ser Leu His Arg Asn Leu Leu Arg 
1X 5 120 125 

Asn Pro Cys Ala Glu Glu Asp Met Phe Ala Trp Gin lie Asp Phe Asn 
130 135 140 

Gly Gly Asp Arg Trp Lys Val Glu Ser Leu Pro Gly Ala His Gly Thr 
145 ISO 155 160 

Asp Phe Pro Asp Pro Lys Val Lys Lys Tyr Phe Val Thr Ser Tyr Glu 
165 170 175 

Met Cys Leu Lys Ser Gin Leu Val Asp Leu Val Ala Glu Gly Tyr Trp 
180 185 190 

Glu Glu Leu Leu Asp Thr Phe Arg Pro Asp He Val Val Lys Asp Trp 
195 200 205 

Phe Ala Ala Arg Ala Asp Cys Gly Cys Thr Tyr Gin Leu Lys Val Gin 
210 215 220 

Leu Ala Ser Ala Asp Tyr Phe Val Leu Ala Ser Phe Glu Pro Pro Pro 
225 230 235 240 

Val Thr He Gin Gin Trp Asn Asn Ala Thr Trp Thr Glu Val Ser Tyr 
245 250 255 

Thr Phe Ser Asp Tyr Pro Arg Gly Val Arg Tyr He Leu Phe Gin His 
260 265 270 

Gly Gly Arg Asp Thr Gin Tyr Trp Ala Gly Trp Tyr Gly Pro Arg Val 
27 5 280 285 

Thr Asn Ser Ser He Val Val Ser Pro Lys Met Thr Arg Asn Gin Ala 
290 295 300 

Ser Ser Glu Ala Gin Pro Gly Gin Lys His Gly Gin Glu Glu Ala ALa 
305 310 315 320 

Gin Ser Pro Tyr Ary Ala Val Val Gin He Phe 
325 330 



\ 
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<210> 303 
<211> 328 
<212> PRT 
<213> Homo sapiens 

<400> 303 

Arg Gin Arg Ser Trp Asn Pro Gly Thr Asn Cys Tyr His Pro Asn Met 
15 10 15 

Pro Ak P Ala Phe Leu Thr Cys Glu Thr Val He Phe Ala Trp Ala He 
20 25 30 

Gly Gly Glu Gly Phe Ser Tyr Pro Pro His Val Gly Leu Ser Leu Gly 
35 40 45 

Thr Pro Leu Asp Pro His Tyr Val Leu Leu Glu Val His Tyr Asp Asn 
50 /" 55 60 

Pro Thr Tyr Glu Glu Gly Leu He Asp Asr. Ser Gly Leu Arg Leu Phe 
g 5 70 75 80 

Tyr Thr Met Asp He Arg Lys Tyr Asp Ala Gly Val He Glu Ala Gly 
85 90 95 

Leu Trp Val Ser Leu ?he His Thr He Pro Pro Gly Met Pro Glu Phe 
100 105 110 

Gin Ser Glu Gly His Cys Thr Leu Glu Cys Leu Glu Glu Ala Leu Glu 
115 120 125 

Ala Glu Lys Pro Ser Gly He His Val Phe Ala Val Leu Leu His Ala 
130 135 140 

His Leu Ala Gly Arg Gly He Arg Leu Arg His Phe Arg Lys Gly Lys 
1^5 ISO 155 160 

Glu Met Lys Leu Leu Ala Tyr Asp Asp Asp Phe Asp Phe Asn Phe Gin 
165 170 175 

Glu Phe Gin Tyr Leu Lys Glu Glu Gin Thr He Leu Pro Gly Asp Asn 
180 185 190 

Leu He Thr Glu Cys Arg Tyr Asn Thr Lys Aso Arg Ala Glu Met Thr 
195 200 205 

Trp Gly Gly Leu Ser Thr Arg Ser Glu Met Cys Leu Ser Tyr Leu Leu 
210 215 220 

Tyr Tyr Pro Arg He Asn Leu Thr Arg Cys Ala Ser tie Pro Asp He 
225 230 235 240 

Met Glu Gin Leu Gin Phe He Gly Val Lys Glu He Tyr Arg Pro Val 
245 250 255 

Thr Thr Trp Pro Phe He He Lys Ser Pro Lys Gin Tyr Lys Asn Leu 
260 255 270 

Ser Phe Met Asp Ala Met Asn Lys Phe Lys Trp Thr Lys Lys Glu Gly 
275 280 285 
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Leu Ser Phe Asn Lys Leu Val Leu Ser Leu Pro Val Asn Val Arg Cys 
290 295 300 

Ser Lys Thr Asp Asn Ala Glu Trp Ser He Pro Arg Asn Asp Ser He 
305 310 315 320 

Thr Ser Arc Tyr Arg Lys Thr Leu 
325 



<210> 304 
<211> 272 
<212> PRT 

<2L3> Homo sapiens 
<400> 304 / 

Ket Cys Cys Trp Pro Leu Leu Leu Leu Trp Gly Leu Leu Pro Gly Thr 
15 io 15 

Ala Ala Gly Gly Ser Gly Arg Thr Tyr Pro His Arg Thr Leu Leu Asp 
2a 25 30 

Ser Glu Gly Lys Tyr Trp Leu Gly Trp Ser Gin Arg Gly Ser Gin He 
35 40 45 

Ala phe Arg Leu Gin val Arg Thr Ala Gly Tyr Val Gly Phe Gly Phe 
50 55 60 

Ser Pro Thr Gly Ala Met Ala Ser Ala Asp He Val Val Gly Gly Val 
6 5 70 75 80 

Ala His Gly Arg Pro Tyr Leu Gin Asp Tyr Phe Thr Asn Ala Asn Arg 
85 90 95 

Glu Leu Lys Lys Asp Ala Gin Gin Asp Tyr His Leu Clu Tyr Ala Met 
100 105 110 

Glu Asn Ser Thr His Thr lie He Glu Phe Thr Arg Glu Leu His Thr 
115 120 125 

Cys Asp He Asn Asp Lys Ser He Thr Asp ser Thr Val Arg Val He 
130 135 140 

Trp Ala Tyr His His Glu Asp Ala Gly Glu Ala Gly Pro Lys Tyr His 
145 150 155 160 

Asp Ser Asn Arg Gly Thr Lys Ser Leu Arg Leu Leu Asn Pro Glu Lys 
165 170 175 

Thr Ser Val Leu Ser Thr' Ala Leu Pro Tyt Phe Asp Leu Val Asn Gin 
180 185 190 

Asp Val ?ro He Pro Asn Lys Asp Thr Thr Tyr Trp Cys Gin Met Phe 
195 200 205 

Lys He Pro Val Phe Gin Glu Lys His His Val He Lys Val Glu Pro 
2ir> 215 220 
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Val lie Gin Arg Gly His Glu Ser Leu Val His His lie Leu Leu Tyr 

225 230 i 235 240 

Gin cys Ser Asn Asn Phe Asn Asp Ser Val Pro Gly lie Arg Ala Arg 

245 250 255 

lie Ala He Thr Pro Thr Cys Pro Met His Ser Ser Pro Val Lys Leu 

260 265 270 



<210> 305 
<211> 207 
<212> PRT 

<213> Homo sapiens f 
<400> 305 

Thr Gly Thr Phe Trp Ser Pro Arg Sfir Gin Arg Arg Gly Cys Cys Gly 
15 10 15 

Arg Arg Ala Pro Arg Pro Glu Ala Met Glu Asn Gly Ala Val Tyr Ser 
20 25 30 

Pro Thr Thr Glu Glu. Asp Pro Gly Pro Ala Arg Gly Pro Arg Ser Gly 
35 40 45 

Leu Ala Ala Tyr Phe Phe Met Gly Arg Leu Pro Leu Leu Arg Arg Val 
50 55 60 

Leu Lys Gly Leu Gin Leu Leu Leu Ser Leu Leu Ala Phe lie cys Glu 
65 70 75 80 

Glu Val Val Ser Gin Cys Thr Leu Cys Gly Gly Leu Tyr Phe Phe Glu 
85 90 95 

Phe Val Ser Cys Ser Ala Phe Leu Leu Ser Leu Leu He Leu lie Val 
100 105 110 

Tyr Cys Thr Pro Phe Tyr Glu Arg Val Asp Thr Thr Lys Val Lys Ser 
115 120 125 

Ser Asp Phe Tyr He Thr Leu Gly Thr Gly Cys Val Phe Leu Leu Ala 
130 135 140 

Ser He He Phe Val Ser Thr His Asp Arg Thr Ser Ala Glu He Ala 
145 ISO 155 160 

Ala He Val Phe Gly Phe He Ala Ser Phe Met Phe Leu Leu Asp Phe 
165 170 175 

He Thr Met Leu Tyr Glu Lys Arg Gin Glu Ser Gin Leu Arg Lys Pro 
180 185 190 

Glu Asn Thr Thr Arg Ala Glu Ala Leu Thr Glu Pro Leu Asn Ala 
195 200 205 
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<210> 306 
<211> 135 
<212> PRT 
<213> Homo sapiens 

<400> 306 

Ala Eer Ala Pro Arg Val Met Arc Gly His Leu Ala Gly Phe Pro Ala 
IS 1Q is 

Leu Ser Gly Leu Ala Ser Val Cys Leu Trp Ala Thr Phe Ser Ala Gin 
20 25 30 

Leu Pro Gly Pro Val Ala Ala Thr Ser Trp Thr Pro Ala Pro Leu Gly 
35 40 45 

Cys Ser Ala Ala Arg Ser Gly Pro Glu Lys Arg Leu Gly Thr Ala Ala 
50 ^55 60 

Pro Gly Ser Ala Ala Ser Leu Ala Gin Ala Gly Pro Gly Ala Pro Cys 
65 70 75 80 

Arg Val Leu Pro Val Asp Pro Ala Pro Ala Ala Leu Asn Val Arg Glu 
85 90 95 

Pro Gly Trp Leu Gly Gly Leu Phe Asp Gly Ala Leu Leu Gin Val Leu 
100 105 no 

Leu Asn Phe Leu Arg Lys Ser Thr Asp Val Leu Met Asp Thr Arg Glu 
115 120 125 

Ala Glu Ser Leu Glu Val Glu 
130 135 



<210> 307 
<211> 188 
<212> PRT 

<213> Homo sapiens 
<40O> 307 

Asn Lys Leu His Ser Phe Pro Val Phe Leu Ser Gin Leu Leu :,eu Asp 
1 5 10 15 

Arg Gin Leu Leu His Ala Pro Gin Thr Leu Pro Thr Pro His Cys Gly 
20 25 30 

Gly Ser Ser Arg Pro Gly Pro Ser His Pro Pro Trp Leu Leu lie Gin 
35 40 45 

Leu Pro Cys Val His Val Ala Leu Trp Gin Met Leu Arg Asp Phe Ser 
50 55 €0 

Asp Ser Arg lie Thr Pro Ser Thr Leu Thr Thr Gin Pro Ala Ala Gin 
65 70 75 fio 

Thr Ala Ala Pro Ala Lys Asp Gin Glu Ser Asp lie Val Gly Gly Glu 
85 90 95 

Gly lie Leu Cys Asp He Ala Phe Leu Gin Glu Asp His Pro Leu Gly 
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100 105 110 

Val Gly Gly Ala Ser Ala Pro Ser Ser Arg Arg Glu Leu Ser Arg Arg 
115 120 125 

Gly Val Kis Thr Gin Thr Leu Pro Glu Asp Gly Thr Leu His Gly Thr 
130 135 140 

Pro Ser Ser Ser Phe Asp Cys Gly lie Lys Tyr lie lie Ser Trp Pro 
145 ISO 155 160 

Leu Ala Pro Gly Cys Asp Leu Pro Ser Leu Glu Leu Ser Leu. Val Cys 
165 170 175 

Lys Gly Vai Ser Ser Cys Met Gly Phe Ala Ala Gly 
180 185 



<210> 308 

<211> 7fl 

<212> PRT 

<213> Homo sapiens 

<400> 308 

Pro Gly Arg Pro Thr Arg Pro Thr Lys Asn Lys Val Cys Val Cys Leu 
15 10 15 

Gly MeL Leu Phe Trp Ala Tyr Pro He Cys Val Phe He Asp Ser Leu 
20 25 30 

Ser Cys Gin Pro Cys Leu Trp Ser Thr Gly Ala Thr Ser His Phe Asn 
35 40 45 

Ser Pro Thr Thr Ser Pro Leu Phe Thr Leu Phe Met Pro Cys Ala Leu 
50 55 60 

Ala Pro Asn Pro Phe Thr Gin Leu Gly Lys Leu Asp Asp Arg 
65 70 75 



<210> 309 

<2H> 10 

<212> PRT 

<213> Homo sapiens 



<400> 309 

Pro Val Asp Leu Thr Lys Thr Arg Leu Gin 
1 5 in 



<210> 310 

<c2ll> 10 

<212> PRT 

<213> Homo sapiens 

<400> 310 

Pro Thr Asp Val Leu Lys He Arg Met Gin 
* 5 10 
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<210> 311 
<211> 313 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITS 
<222> (117> 

<223> Xaa equals any of the naturally occurring I, -amino acids 
<400> 311 

Met Thr Phe Gly Ser Thr He Ser Pro Thr Ser Thr His Ala Ser Pro 
1 5 .10 15 

Ser Leu Gly Phe Cys Cys Ser Trp Leu Leu Glu Asp Leu Glu Glu Gin 
20 * 25 30 

Leu Tyr Cys Ser Ala Phe Glu Glu Ala Ala Leu Thr Arg Arg lie Cys 
35 40 45 

Asn Pro Thr Ser Cys Trp Leu Pro Leu Asp Met Glu Leu Leu His Atq 
50 55 60 

Gin Val Leu Ala Leu Gin Thr Gin Arg Val Leu Leu Gly Mot Trp Leu 
65 70 75 ao 

Arg Arg Ala Trp Asp Thr Trp Val Ser Pro Arg Arg Val Ala Pro Gly 
85 90 95 

Ser Arg Cys Leu Leu Thr Ala Ser His Pro Cys Thr Glu Lys Arg Arg 
100 105 110 

Lys Ala Ser Ala Xaa Gin Arg Asn Leu Gly Tyr Pro Leu Ala Met Leu 
115 120 125 

Cys Leu Leu Val Leu Thr Gly Leu Ser Val Leu He Val Ala lie His 
130 135 140 

lie Leu Glu Leu Leu He Asp Glu Ala Ala Met Pro Arg Gly Ket Gin 
145 150 155 160 

Gly Thr Ser Leu Gly Gin Val Ser Phe Ser Lys Leu Gly Ser Phe Gly 
165 170 175 

Ala Val He Cln Val Val Leu He Phe Tyr Leu Met Val Ser Ser Val 
180 185 190 

Val Gly Phe Tyr Ser Ser Pro Leu Phe Arg Ser Leu Arg Pro Arg Trp 
195 200 205 

His Asp Thr Ala Met Thr Gin He He Gly Asn Cys Val Cys Leu Leu 
210 215 220 

Val Leu Ser Ser Ala Leu Pro Val Phe Ser Arg Thr Leu Gly Leu Thr 
225 230 235 240 

Arg Phe Asp Leu Leu Gly Asp Phe Gly Arg Phe Asn Trp Leu Gly Asn 
245 250 255 
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Phe Tyr He Val Phe Leu Tyr Asn Ala Ala Phe Ala Gly Leu Thr *Thr 
260 265 270 

Leu Cys Leu Val Lys Thr Phe Thr Ala Ala Val Arg Ala Glu Leu He 
275 280 285 

Arg Ala Phe Gly Leu Asp Arg Leu Pro Leu Pro Val Ser Gly Phe Pro 
290 295 300 

Gin Ala Ser Arg Lys Thr Gin His Gin 
305 310 



<210> 312 

<211> 92 

<212> PRT f 

<213> Homo sapiens 

<400> 312 

Leu Cys Val Cys Leu Val Tyr Leu Cys Met Tyr Gly Val Cys Leu Cys 
1 5 10 15 

Val lie Val Cys Val Ser Gly Val Ser Leu Cys Leu Tyr Val Trp Gly 
20 25 30 

Val Scr Val Cys Asp Cys Val Ser Val Phe Met Cys Val Cys Leu Cys 
35 40 45 

Val He Phe Cys Val Tyr Gly Lys Pro Arg Thr Glu His Tyr His Ser 
50 55 60 

Pro His Leu Ala Lys Gin Lys Ala Phe Arg Glu Met Cys Gly Arg His 
65 70 75 80 

Asp Val Ser Ala Ala Gly He Phe Gin Ser Tyr Val 
85 90 



<21C> 313 
<211> 207 
<212> PRT 

<213> Homo sapiens 
<400> 313 

Gly His Met Pro Tyr Gly Trp Leu Thr Glu He Arg Ala Val Tyr Pro 
1 5 10 15 

Ala Phe Asp Lys Asn Asn Pro Ser Asn Lys Lea Val Ser Thr Ser Asn 
20 25 30 

Thr Val Thr Ala Ala His He Lys Lys Phe Thr Phe Val Cvs Met Ala 
35 40 45 

Leu Ser Leu Thr Leu Cys Phe Val Met Phe Trp Thr Pro Asn Val Ser 
50 55 60 

Glu Lys He Leu Tie Asp He He Gly Val Asp Phe Ala Phe Ala Glu 
65 70 75 80 
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Leu Cys Val Val Pro Leu Arg lie Phe Ser Phe Phe Pro Val Pro Val 
85 90 95 

Thr Val Arg Ala His Leu Thr Gly. Trp Leu Met Thr Leu Lys Lys Thr 
100 105 110 

Phe Val Leu Ala Pro Ser Ser val Leu Arg lie lie Val Leu lie Ala 
115 120 125 



Ser Leu Val Val Leu Pro Tyr Leu 
130 135 

Val Gly Ser Leu Leu Ala Gly Phe 
145 150 

lie Ala Ala Cys Tyr- Val Tyr Arg 
165 

Glu Ser Ala Thr Glu Gly Glu Asp 
180 

Thr Glu Glu Val Thr Asp He Val 
195 200 



Gly Val His Gly Ala Thr Leu Gly 
140 

Val. Gly Glu Ser Thr Met Val Ala 
155 160 

Lys Gin Lys Lys Lys Met Glu Ash 
170 175 

Ser Ala Met Thr Asp Met Pro Pro 
185 190 

Glu Met Arg Glu Glu Asn Glu 
205 



<210> 314 
<211> 114 
<212> PRT 

<213> Homo sapiens 



<400> 314 

Gin Val Val Phe Val Ala He Leu 
1 5 

Glu Pro Leu Leu He Pro He Leu 
20 



Leu His Ser His Leu Glu Cys Arg 
10 15 

Ser Leu Tyr Met Gly Ala Leu Val 
25 30 



Arg Cys Thr Thr Leu Cys Leu Gly Tyr Tyr Lys Asn He His Asp He 

35 40 45 

He Pro Asp Arg Ser Gly Pro Glu Leu Gly Gly Asp Ala Thr lie Arg 

50 55 60 



Lys Met Leu Ser Phe Trp Trp Pro Leu Ala Leu He Leu Ala Thr Gin 
65 70 75 80 

Arg He Ser Arg Pro He Val Asn Leu Phe Val Ser Arg Asp Leu Gly 
85 90 95 

Gly Ser Ser Ala Ala Thr Glu Ala Val Ala He Leu Thr Ala Thr Tyr 
100 105 110 

Pro Val 



<210> 315 
<211> 115 



WO 00/06698 



209 



PCT/US99/17I30 



<212> PRT 

<213> Homo sapiens 

<400> 315 

Arg Cys Cys Cys Arg Gly Cys Ser Cys Arg Ala Arg Leu Cys Pro Pro 
15 10 is 

Ala Arg ser Thr Ala Val Ala Pro Glu Cys Arg Gly Ala His Pro Ser 
20 25 30 

Arg Ala Met Arg Pro Gly Thr Ala Leu Gin Ala Val Leu Leu Ala Val 
35 40 45 

Leu Leu Val Gly Leu Arg Ala Ala Thr Gly Arg Leu Leu Ser Gly Gin 
50 55 60 

Pro Val Cys Arg Gly Gly.^Thr Gin Arg Pro Cys Tyr Lys Val lie Tyr 
65 70* 75 80 

?he His Asp Thr Ser Arg Arg r.en Asn Phe Glu Glu Ala Lys Glu Ala 
85 90 95 

Cys Arg Arg Gly Trp Arg Pro Ala Ser Gin His Arg Val Leu Lys Met 
100 105 110 

Asn Arg Asn 
115 



<210> 316 

<211> 81 

<212> PRT 

<213> Homo sapiens 

<400> 316 

Met Arg Pro Gly Thr Ala Leu Gin Ala Val Leu Leu Ala Val Leu Leu 
15 10 15 

Val Gly Leu Arg Ala Ala Thr Gly Arg Leu Leu Ser Gly Gin Pro Val 
20 25 30 

Cys Arg Gly Gly Thr Gin Arg Pro Cys Tyr Lys Val lie Tyr Phe His 
35 40 45 

Asp Thr Ser Arg Arg Leu Asn Phe Clu Glu Ala Lys Glu Ala Cys Arg 
50 55 60 

Arg Gly Trp Arg Pro Ala Ser Gin Kis Arg Val Leu Lys Met Asn Arg 
65 70 75 80 

Asn 



<210> 317 
<211> 290 
<212> PRT 
<213> Homo sapiens 
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<400> 317 

He Arg His Glu Gin Gin Gly Glu Glu Asp Asp Glu His Ala Arg Pro 
15 10 15 

Leu Ala Glu Ser Leu Leu Leu Ala He Ala Asp Leu Leu Phe Cys Pro 
20 25 30 

Asp Phe Thr Val Gin Ser; His Arg Arg Ser Thr Val Asp Ser Ala Glu 
35 40 45 

Asp Val His Ser Leu Asp Ser Cys Glu Tyr He Trp Glu Ala Gly Val 
50 55 60 

Gly Phe Ala His Ser Pro Gin Pro Asn Tyr He His Asp Met Asn Ara 
65 70 75 80 

Met Glu Leu Leu Lys Le£ Leu Leu Thr Cys Phe Ser Glu Ala Met Tyr 
85 90 95 

Leu Pro Pro Ala Pro Glu Ser Gly Ser Thr Asn Pro Trp Val Gin Phe 
LOO 105 no 

Phe Cys Ser Thr Glu Asn Arg His Ala Leu Pro Leu Phe Thr Ser Leu 
115 120 125 

Leu Asn Thr Val Cys Ala Tyr Asp Pro Val Glv Tyr Gly He Pro Tyr 
130 135 14Q 

Asn His Leu Leu Phe Ser Asp Tyr Arg Glu Pro Leu Val Glu Glu Ala 
145 150 155 leo 

Ala Gin Val Leu He Val Thr Leu Asp His Asp Ser Ala Ser Ser Ala 
165 170 175 

Ser Pro Thr Val Asp Gly Thr Thr Thr Gly Thr Ala Met Asp Asp Ala 
180 185 190 

Asp Pro Pro Gly pro Glu Asn Leu Phe Val Asn Tyr Leu Ser Arg He 
195 200 205 

His Arg Qlu Glu Asp Phe Gin Phe He Leu Lys Gly He Ala Arg Leu 
210 215 220 

Leu Ser Asr. Pro Leu Leu Gin Thr Tyr Leu Pro Asn Ser Thr Lys Lys 
225 230 235 240 - 

Asp Pro val Pro Pro Gly Ala Ala Ser Ser Leu Leu Glu Ala Leu Arg 
245 250 255 

Leu Gin Gin Glu He Pro Leu Leu Arg Ala Glu Glu Gin Arg Arg Pro 
260 265 270 

Arg His Pro Cys Pro His Pro Leu Leu Pro Gin Arg Cys Pro Glv A-a 
275 280 285 

Ser Val 
290 
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<210> 318 
<211> 318 
<212> prt 
<213> Homo sapiens 

<400> 318 

Arg Lou Val Tyr Asn Lys Thr Ser Arg Ala Thr Gin Phe Pro Asp Gly 
1 5 l0 15 

Val Asp Val Arg Val Pro Gly Phe Gly Lys Thr Phe Ser Leu Glu Ph*> 
20 25 30 

Leu Asp Pro Ser Lys ser Ser Val Gly Ser Tyr Phe His Thr Met Val 
35 40 45 

Glu Ser Leu Val Gly Trp Gly Tyr Thr Arg Gly Glu Asp Val Arg Gly 
50 .55 60 

Ala Pro Tyr Asp Trp Arg Arg Ala Pro Asn Glu Asn Gly Pro Tyr Phe 
65 7n 75 80 



Leu Al* Leu Arg Glu Met lie Glu Glu Ket Tyr Gin Leu Tyr Gly Gly 
85 go 95 

Pro Val Val Leu Val Ala His Ser Met Gly Asn Met Tyr Thr Leu Tvr 
100 105 110 Y 

Phe Leu Gin Arg Gin Pro Gin Ala Trp Lys Asp Lys Tyr He Arg Ala 
115 120 125 

Phe val Ser Leu Gly Ala Pro Trp Gly Gly Val Ala Lys Thr Leu Arg 
130 135 140 3 

Val Leu Ala Ser Gly Asp Asn Asn Arg He Pro Val He Gly Pro Leu 
Lys He Arg Glu Gin Gin Arg Ser Ala Val Ser Thr Ser Trp Leu Leu 



165 



1'0 175 



Pro Tyr Asn Tyr Thr Trp Ser Pro Glu Lys Val Phe Val Gin Thr Pro 
180 IBS 190 

Thr He Asn Tyr Thr Leu Arg Asp Tyr Arg Lys Phe Phe Gin Asp He 
195 200 205 

Gly Phe Glu Asp Gly Trp Leu Met Arg Gin Asp Thr Glu Gly Leu Val 
210 215 220 

Glu Ala Thr Met Pro Pro Gly Val Gin Leu His Cys Leu Tyr Gly Thr 
225 230 235 Y 240 

Gly Val Pro Thr Pro Asp Ser Phe Tyr Tyr Glu Ser Phe Pro Asp Arg 
245 250 255 

Asp Pro Lys He Cys Phe Gly Asp Gly Asp Gly Thr Val Asn Leu Lys 
260 265 270 

Ser Ala Leu Gin Cys Gin Ala Trp Gin Ser Arg Gin Glu His Gin Val 
275 280 285 
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Leu Leu Gin Glu Leu Pro Gly Ser Glu His He Glu Met Leu Ala Asn 
290 295 300 

Ala Thr Thr Leu Ala Tyr Leu Lys Arg Val Leu Leu Gly Pro 
305 310 315 



<210> 319 

<211> 362 

<212> PRT 

<213> Homo sapiens 

<400> 319 

Met Asn Lys Glu Asp Lys Val Trp Asn Asp Cys Lys Gly Val Asn Lys 
1 5 10 15 

-? 

Leu Thr Asn Leu Glu Glu Gin Tyr lie Tie Leu lie Phe Gin Asn Gly 
20 25 30 

Leu Asp Pro Pro Ala Asn Met Val Phe Glu Ser He lie Asn Glu He 
35 40 45 

Gly He Lys Asn Asn He Ser Asn Phe Phe Ala Lys He Pro Phe Glu 
50 55 so 

Glu Ala Asn Gly Arg Leu Vctl Ala Cys Thr Arg Thr Tyr Glu Glu Ser 
65 70 75 80 

He Lys Gly Ser Cys Gly Gin Lys Glu Asn Lys He Lys Thr Val Ser 
85 go 95 

Phe Glu Ser Lys He Gin Leu Arg Ser Lys Gin Glu Phe Gin Phe Phe 
100 105 no 

Asp Glu Glu Glu Glu Thr Gly Glu Asn His Thr He Phe He Gly Pro 
H5 120 125 

Val C-lu Lys Leu He Val Tyr Pro Pro Pro Pro Ala Lys Gly Gly He 
130 135 140 

Ser Val Thr Asn Glu Asp Leu His Cys Leu Asn Glu Gly Glu Phe Leu 
145 150 155 160 

Asn Asp Val He He Asp Phe Tyr Leu Lys Tyr Leu Val Leu Glu Lys 
165 170 175 

Leu Lys Lys Glu Asp Ala Asp Arg He His He Phe Ser Ser Phe Phe 
180 185 190 

Tyr Lys Arg Leu Asn Gin Arg Glu Arg Arg Asn His Glu Thr Thr Asn 
195 200 205 

Leu Ser He Gin Gin Lys Arg His Gly Arg Val Lys Thr Trp Thr Arg 
210 215 220 

His Val Asp He Phe Glu Lys Asp Phe He Phe Val Pro Leu Asn Glu 
225 230 235 240 
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Ala Ala His Trp Phe Leu Ala Val Val Cys Phe Pro Gly Leu Glu Lys 
245 , 25C 255 • 

Pro Lys Tyr Glu Pro Asn Pro His Tyr His Glu Asn Ala Val lie Gin 
260 265 270 

Lys Cys Ser Thr Val Glu Asp Ser Cys He Ser Ser Ser Ala Ser Glu 
275 280 285 

Met Glu Ser Cys Ser Gin Asn Ser Ser Ala Lys Pro Val He Lys lys 
290 295 * 300 

Met Leu Asn Lys Lys His Cys He Ala Val He Asp Ser Asn Pro Gly 
305 310 315 320 

Gin Glu Glu Ser Asp Pro Arg Tyr Lys Arg Asn He Cys Ser Val Lys 
325 f 330 335 

Tyr Ser val Lys lys He Asn His Thr Ala Ser Glu Asn Glu Glu Phe 
340 345 350 

Asn Lys Gly Glu Ser Thr Scr Gin Lys Ser 
355 360 



<210> 320 
<211> 330 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (38) 

<223> xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (247) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 320 

Met Ser Pro Leu Ser Ala Ala Arg Ala Ala Leu Arg Val Tyr Ala Val 
1 5 10 15 

Gly Ala Ala Val He Leu Ala Gin Leu Leu Arg Arg Cys Arg Gly Gly 
20 25 30 

Phe Leu Glu Pro Val Xaa Pro Pro Arg Pro Asp Arg Val Ala He Val 
35 40 45 

Thr Gly Gly Thr Asp Gly He Gly "Tyr Ser Thr Ala Asn He Trp Arg 
50 55 60 

Asp Leu Gly Met His Val He He Ala Gly Asn Asn Asp Ser Lys Ala 
65 70 75 80 

Lys Gin Val Val Ser Lys Xle Lys Glu Glu Thr Leu Asn Asp Lys Val 
85 9G 95 
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Glu Phe Leu Tyr Cys Asp Leu Ala Ser Mer Thr Ser He Arg Gin Phe 
100 105 no 

Val Gin Lys Phe Lys Met Lys Lys lie Pro Leu His Val Leu lie Asr. 
115 120 125 

Asn Ala Gly Val Met Met Val Pro Gin Arg Lys Thr Arg Asp Gly phe 
130 135 140 

Glu Glu His Phe Gly Leu Asa Tyr Leu Gly His Phe Leu Leu Thr Asn 
145 150 155 160 

Leu Leu Leu Asp Thr Leu Lys Glu Ser Gly Ser Pro Gly His Ser Ala 
iSS . 170 175 

Arg Val Val Thr Val Ser Ser Ala Thr His Tyr Val Ala Glu Leu Asn 
180 # 185 190 

Met Asp Asp Leu Gin Ser Ser Ala Cys Tyr Ser Pro His Ala Ala Tyr 
195 200 205 

Ala Gin Ser Lys Leu Ala Leu Val Leu Phe Thr Tyr His Leu Gin Arg 
210 215 220 

Leu Leu Ala Ala Glu Gly Ser His Val Thr Ala Asn Val Val Asp Pro 
225 230 235 240 

Gly Val Val Asn Thr Asp Xaa Tyr Lys His Val Phe Trp Ala Thr Arg 
245 250 255 

Leu Ala Lys Lys Leu Leu Gly Trp Leu Leu Phe Lys Thr Pro Asp Glu 
260 265 270 

Gly Ala Trp Thr Ser lie Tyr Ala Ala Val Thr Pro Glu Leu Glu Gly 
275 280 285 

Val Gly Gly Arg Tyr Leu Tyr Asn Glu i_ys Glu Thr Lys Ser Leu His 
220 295 300 

Val Thr Tyr Asn Gin Lys Leu Gin Gin Gin Leu Trp Ser Lys Ser Cys 
3 °5 310 315 320 

Glu Met Thr Gly Val Leu Asp Val Thr Leu 
. 325 330 



<210* 321 

<211> 71 

<212> PRT 

<213> Homo sapiens 

<220> 

<221;> SITE 
<222> (38) 

<223> Xaa equals any of the naturally occurring L-amir.o acids 
<400> 321 

Met Ser Pro Leu Ser Ala Ala Arg Ala Ala Leu Arg Val Tyr Ala Val 
1 5 10 15 
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Gly Ala Ala Val He Leu Ala Glo 
20 

Phe Leu Glu Pro Val Xaa Pro Pro 
35 40 

Thr Gly Gly Thr Asia Gly He Gly 
50 55 

Asp Leu Ala Cys Met Leu Ser 
65 70 



Leu Leu Arg Arg Cys Arg Gly Gly 

25 30 

Arg Pro Asp Arg Val Ala He Val 
45 

Tyr Ser Thr Ala Asn He Trp Arg 
60 



<210> 322 
<211> 266 

<212> PRT . 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (97) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (174) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (195) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (199) 

<223> xaa equals any oE the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (206) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 322 

Met Glu Val Thr Thr Glu Asp Thr Ser Arg Thr Asp Val Ser Glu Pro 
1 5 io 



15 



Ala Thr Ser Gly Gly Ala Ala Asp Gly Val Thr Sex He Ala Pro Thr 
20 25 30 

Ala Val Ala Ser Ser Thr Thr Ala Ala Ser He Thr Thr Ala Ala Ser 
35 40 45 

Ser Met Thr Val Ala Ser Ser Ala Pro Thr Thr Ala Ala Ser Ser Thr 
50 55 60 

Thr Val Ala Ser lie Ala Pro Thr Thr Thr Ala Ser Ser Met Thr Ala 
65 70 75 ro 
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Ala Ser Ser Thr Pro Met Thr Leu Ala Leu Pro Ala Pro Thr Ser Thr 
85 90 95 

Xaa Thr Gly Arg Thr Pro Ser Thr Thr Ala Thr Gly His Pro Ser Leu 
100 105 110 

Ser Thr Ala Leu Ala Gin Val Pro Lys ser Ser Ala Leu Pro Arg Thr 
115 120 125 

Ala Thr Leu Ala Thr Leu Ala Thr Arg Ala Gin Thr Val Ala Thr Thr 
130 135 140 

Ala Asn Thr Ser Ser Pro Met Ser Thr Arg Pro Ser Pro Ser Lys His 
, 145 150 155 160 

Met Pro Ser Asp Thr Ala A£a Ser Pro Val Pro Pro Met Xaa Pro Gin 
165 170 175 

Ala Gin Gly Pro He Ser Gin Val Ser Val Asp Gin Pro Val V*l Asn 
180 185 190 

Thr Thr Xaa Lys Ser Thr Xaa Met Pro Ser Asn Thr Thr Xaa Glu Pro 
195 200 205 

Leu Thr Gin Ala Val Val Asp Lys Thr Leu Leu Leu Val Val Leu Leu 
210 215 220 

Leu Gly Val Thr Leu Phe lie Thr Val Leu Val Leu Phe Ala Leu Gin 
225 230 235 240 

Ala Tyr Glu Ser Tyr Lys Lys Lys Asp Tyr Thr Gin Val Asp Tyr Leu 
245 250 255 

He Asn Gly Met Tyr Ala Asp Ser Glu Met 
260 265 



<210> 323 
<211> 99 
<212> PRT 

<2 13 > Homo sapiens 
<400> 323 

Ala Arg Cys Pro Glu Leu Pro Gly Leu Arg Cys Arg Pro Arg Pro Arg 
15 10 15 

Ala Gly Pro Gin Ala Pro Ser Tyr Cys Pro Arg Ala Thr Arg Pro Pro 
20 25 30 

Gly Ala Cys Cys Ala Arg Ket Arg Leu Leu Leu Glu Trp Arg Val Tyr 
35 40 45 

Leu Arg Leu Thr Cys Ala Thr Lys Asp Gly Met Ala Arq Glu Cys Pro 
50 55 60 

Thr Thr Trp Leu Ser Pro Pro Ala Lys Pro Asp Phe Ala Gin Arg His 
65 *?0 75 80 
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Ser Val Lys Pro Thr Ala Leu Gin Gly Gly Arg Trp Ser Arg Leu Gly 
85 90 95 ■ 

Ala Ser Pro 



<210> 324 

<211> 96 

<212> PRT 

<213> Hono sapiens 

<40O> 324 

Leu Pro Ala Thr Val Glu Phe Ala Val His Thr Phe Asn Gin Gin Ser 
1 5 10 15 

Lys Asp Tyr Tyr Ala ^Tyr Arg Leu Gly His He Leu Asn Ser Trp Lys 
20 25 30 

Glu Gin Val Glu Ser Lys Thr Val Phe Ser Met Glu Leu Leu Leu Gly 
35 40 45 

Arg Thr Arg Cys Gly Lys Phe Glu Asp Asp He Asp Asn Cys His Phe 
50 55 60 

Gin Glu Ser Thr Glu Leu Asn Asn Thr Phe Thr Cys Phe Phe Thr He 
65 70 75 80 

Ser Thr Arg Pro Trp Met Thr Gin Phe Ser Leu Leu Asn Lys Thr Cys 
85 90 95 



<210> 325 

<211> 166 

<212> PRT 

<213> Homo sapiens 

<400> 325 

Leu Leu Trp Ala Arg Gly Leu Gly Arg Ala Lys Ser Ala Val Pro Thr 
1 5 io is 

Val Ser Thr Met Leu Gly Leu Pro Trp Lys Gly Gly Leu Ser Trp A La 
20 25 30 

Leu Leu Leu Leu Leu Leu Gly Ser Gin He Leu Leu He Tyr Ala Trp 
35 40 45 

His Phe His Glu Gin Arg Asp Cys Asp Glu His Asn Vol Ket Ala Arq 
50 55 60 

Tyr Leu Pro Ala Thr Val Glu Phe Ala Val His Thr Phe Asn Gin Gin 
65 70 75 80 

Ser Lys Asp Tyr Tyr Ala Tyr Arg Leu Gly His He Leu Asn Ser Trp 
85 90 95 
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Lys Glu Gin Val Glu Ser Lys Thr Val Phe Ser Met CI u Leu Leu Leu 
100 105 no 

Gly Arg Thr Arg Cys Gly Lys Phe Glu Asp Asp lie Asp Asn Cys His 
115 120 125 

Phe Gin Glu Ser Thr Glu Leu Asn Asn Thr Phe Thr Cys Phe Phe Thr 
"0 135 140 

lie Ser Thr Arg Pro Trp Met Thr Gin Phe Ser Leu Leu Asn Lys Thr 
145 150 155 ifio 

Cys Leu Glu Gly Phe His 
165 



<210> 326 ^ 
<211> 214 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (200) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (205) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 326 

Leu Glu Gin Lys Leu Glu Leu His Arg Gly Gly Gly Arg Ser Arg Thr 
1 5 io is 

Ser Gly Ser Pro Gly Leu Cln Glu Phe Gly Thr Arg Glu Glu Arg Gly 
20 25 30 

Glu Gly Glu Gin Arg Thr Gly Arg Glu Phe Ser Gly Asn Gly Gly Arg 
35 40 45 

Ala Val Glu Ala Ala Arg Met Arg Leu Leu Cys Gly Leu Trp Leu Trp 
5 <J 55 60 

Leu Ser Leu Leu Lys Val Leu Gin Ala Gin Thr Pro Thr Pro Leu Pro 
65 70 75 80 

Leu Pro Pro Pro Ket Gin Ser Phe Gin Gly Asn Gin Phe Gin Gly Glu 
85 90 95 

Trp Phe Val Leu Gly Leu Ala Gly Asn Ser Phe Arg Pro Glu His Arg 
100 105 no 

Ala Leu Leu Asn Ala Phe Thr Ala Thr Phe GLu Leu Ser Asp Asp Gly 
liS 120 125 

Arg Phe Glu Val Trp Asn Ala Met Thr Arg Gly Gin His Cys Asp Thr 
!30 135 140 
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Trp Ser Tyr Val Leu lie Pro Ala Ala Gin Pro Gly Gin Phe Thr Val 
145 150 155 160 

Asp His Gly Val Gly Arg Ser Trp Leu Leu Pro Pro Gly Thr Leu Asp 
155 170 175 

Gin Phe lie Cys Leu Gly Arg Ala Gin Gly Leu Ser Asp Asp Asn He 
180 185 190 

Val Phe Pro Asp Val Thr Gly Xaa Ala Leu Aso Leu Xaa Ser Leu Pro 
195 200 205 

Trp Val Ala Ala Pro Ala 
210 



<210> 327 A . 
<211> 1B1 
<212> PRT 

<213> Homo sapiens 
<400> 327 

Met Cys Val Cys glu Arg Lys Arg Gly Arg Glu Lys Glu Gly Gly Val 
1 5 10 15 

Thr Pro Thr Met Thr Ser Asn Phe Pro Phe Cys Thr Leu He Leu Gly 
20 25 30 

He Ala Gin Ala Gin Ala Cys Pro Gly Cys Pro Gly Asp Trp Pro Gly 
35 40 45 

Leu Gly Ser Gly Val Gly Glu Gly Leu His His He Arg Thr Cys Arg 
50 55 60 

Thr Pro He Pro Cys Ser Pro Pro Ala Pro Ala Ala Ala Cys Leu Gly 
65 70 75 80 

Ser Gly His Ala Arg Leu Pro Cys Val Leu Arg Leu Trp Pro Val Pro 
85 90 95 

Ala Asn Leu Ser Ser Pro Phe Arg ueu Glu Ala Leu His Cys Ser Phe 
100 105 HO 

Trp Ser Ser Pro Leu Leu Pro Ala Pro His Leu Ala Phe Phe Gly Phe 
115 120 125 

Arg Asp Leu Leu Thr Asp Phe Leu Leu Ala Ala Cys Leu Leu Thr Phe 
130 135 140 

Gin Lys Thr Pro Leu Glu Leu Pro Met Ala Val Val His Leu Leu Val 
145 150 • 155 160 

Ala Thr Pro Cys Tyr Gin Met Leu Asp Asn Leu Pro Leu Pro Ser Ala 
165 170 175 

Ala Ala Asn Trp Cys 
1B0 
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<210> 328 

<211> 195 

<212> PRT 

<213> Homo sapiens 

<400> 32B 

Tyr Leu Trp Gly Arg Pro Arg Leu Arg Mat Arg Ala Gly Thr Ser P-o 
1 S 10 15 

Ser Ala Pro Trp Gly Glu Lys Arg Glu Lys Leu Gly His Lys Leu Pro 
20 25 30 

Val Ala Leu Gin Gly Tyr His Pro Trp He Leu Leu Glu Cys Thx Val 
35 40 45 

Phe Trp Ala Arg Val Val Leu Ala Cys Phe Ser Leu Tyr Leu He Arg 
50 -55 60 

Gly pro Asn Cys He Asn Arg Gin Pro Glu Pro Thr Tyr Gin Lys Ala 
65 70 75 ' fio 

Cys Asn Leu Asp Cys Scr Ser Asp Phe Gly Gin Glu Arg Ala Pro Ala 
85 90 95 

Trp Glu Leu Leu Gly Pro Glu Ser Glu Gin Arg Leu Arg Glu Tyr Thr 
100 105 no 

Ala Gin Gly Leu Gin Ser Leu Ala Ser Ser His Arg Trp Arg Gin Phe 
115 120 125 

Lys Thr Glu Gly Lys Met Arg Gly Gly Ala Ser Pro Leu Pro Trp Leu 
130 135 140 

He Cys Phe Trp Leu Cys Ser Tyr Lys Gly Ser Asp Asn Ser Leu Lys 
145 150 155 160 

Pro Val Val Pro Gly Pro Thr Leu Cys Pro Gin Ser Leu Val Ser Pro 
165 170 175 

Ser Val His Pro Ser Thr Arg Ser Ala Ser Leu Gly Arg His Arg Ala 
180 185 190 

Glu Ala Ala 
195 



<210> 329 

<211> 50 

<212> PRT 

<213> Homo sapiens 

<400> 329 

Met Pro Gly lie Leu Ala Gly He 
1 5 

Leu Leu Gin Gly Phe Arg Leu Leu 
20 

Leu Ser Gly Trp Met Gly Gly Gin 



Pro Val Lys Asp Leu Cys Leu Ser 
10 15 

Leu Leu Cy3 Val Cya Pro Gly Trp 
25 30 

Lys Gly Ser Pro Arg He Val Asp 
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35 



40 



45 



lie Gly 
50 



<210> 330 

<211> 90 

<212> PRT 

<213> Homo sapiens 

<400> 330 

Ala Lys Gly Glu Glu Arg Lys Glu Ala Phe Ser Leu Lys Met Val Gin 
1 5 .10 15 

Leu Ser Ser Glu Pro lie Ser Phe Gly Leu Met: Tyr Leu Tyr Leu Gly 
20 j 25 30 

val Phe Phe His Leu lie Tyr Pro Gly Ala Leu Ser He Thr Thr Leu 
35 40 45 

Gly Lys His Ser His Pro Phe Phe Thr Ala Glu Gin Asa Ser Thr Val 
50 55 60 



Trp Met Glu His Thr Leu Phe His Gin Ser Pro Val Ala Ser His Leu 

65 70 75 80 

Val Cys Phe Gin Ser Flie Ala Phe Ser Glu 
85 90 



<210> 331 

<211> 56 

<212> PRT 

<213> Homo sapiens 

<400> 331 

Gly Pro Ala His Pro Ala Ser Pro Pro Leu Met Thr Leu Ser Leu Gin 
1 5 10 15 

Leu Ala Glu Leu Val His Phe Val Cys Ala Phe Gin Ser Gin Trp Thr 
20 25 30 

Gly Val Tyr Pro Met Met Pro Pro Leu Lys Pro Thr Glu Pro Leu Cys 
35 40 45 

Phe Ala Cys Val Pro Cys Arg Val 
50 55 



<210> 332 

<211> IB 

<212> P3T 

<213> Homo sapiens 



<400> 332 

Met Leu Leu Glu Val Tyr Gly Asp Ser He Ser Val Thr Val Ala He 
1 5 10 15 
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Pro Leu 



<210> 333 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 333 

Met His Ser Pro Cys Gin Ser Lys Ala Ala Asp Gly Leu Gly Lys Ser 
1 5 10 15 

Glu Thr Glu 



<210> 334 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 334 

Met Leu Lys Ser Leu Gly Leu Ser Thr Asn 
1 5 10 



<210> 335 
<211> 200 
<212> PRT 
<213> Homo sapiens 

<400> 335 

Ala Gin Arg Leu Ala Glu Glu Cys Phe Tyr Met Leu Leu Glu Val Tyr 
1 5 10 15 

Gly Asp Ser He Ser Val Thr Val Ala He Pro Leu Met His Ser Fro 
20 25 30 

Cys Gin Ser Lys Ala Ala Asp Gly Leu Gly Lys Ser Glu Thr Glu Met 
35 40 45 

Leu Lys Ser Leu Gly Leu Ser Thr Asn Met Ser Pro Phe His Leu Leu 
50 55 60 

Gly Leu Lys Val Phe Leu Thr Trp Ala Leu Thr Leu Ala Gin He Cys 
65 70 75 80 

Leu Tyr Phe Phe Glu Val Gin Pro Leu Gly Leu Leu Ala Leu Asn Phe 
85 90 95 

Phe Cys Thr Ala Thr Ala Gly Leu Lys Glu Leu Cys Met His Pro Pro 
100 105 HO 

Ser Leu Ala Phe Thr Pro Glu Phe His Thr Ser Leu Ser Pro Leu Ala 
115 120 125 

He Pro Ser Phe Cys Gly Thr Ser Val Ser Leu Ser Asn Ser His Thr 
130 135 140 
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He Pro Leu Ser Leu Tyr Leu Pro Phe Pro Ser Lys Ser Arg Met Pro 
145 ISO 155 160 

Asp Thr Leu His Leu Leu Vai His Ser Leu Pro Leu Val His Ser Gin 
165 170 175 

Val Leu Pro Val Lys Asp Val Thr He Glu Trp Pro Leu Cys Gin Arg 
180 185 190 

Cys Leu Gly Ser Thr Cys His Gin 





195 200 




<210> 


336 




<211> 


99 




<212> 


PRT ~- 




<213> 


Homo sapiens 




<220> 






<221> 


SITE 




<222> 


(94) 




<223> 


Xaa equals any of the naturally occurring 


L- amino 


<220> 






<221> 


SITE 




<222> 


(99) 




<223> 


Xaa equals any of the naturally occurring 


L -amino 


<40O> 


336 




Trp He Pro Arg Ala Ala Gly He Arg His Glu Val 


Gin Val 


1 


5 10 





15 

Phe Gin Met Phe cys Phe Ser Ser He Phe Cys Ser His Glu His Thr 
20 25 30 

His Leu Pro Gly Thr Phe Trp Leu Pho Leu Phe Leu Phe Leu He Leu 
35 40 45 

Pro Pro Ser Cys Pro Cys Phe Leu Pro Phe Ser Leu Ala He Glu Thr 
50 55 60 

Val Arg Trp Pro Cys Trp His His Pro Thr Ser Phe Glu Leu Cys Tyr 
65 70 7b 80 

Pro Gly Thr Ser He Tyr Tyr Ala Ser Arg Gly Gly Pro Xaa Pro Asn 
85 90 95 

Ser Glu Xaa 



<210> 337 
<211> 96 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
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<222> (1} 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (3) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 337 

Xaa Asn Xaa Lys Ser Pro leu Thr He Gly Asn Lys Ser Trp Ser Ser 
15 10 is 

Thr Ala Val Ala Ala Ala Leu Glu Leu Val Asp Pro Pro Gly Cys Arg 
20 25 . 30 

< Asn Ser Ala Arg Asp Ser Pro Glu Leu Val His Leu Gly Lys Gly Arg 
35 / 40 45 

Pro Arg Lys Leu Met Thr Tyr Leu Phe Cys Ser Ser He Ser Leu Leu 
50 55 eo 

Leu Leu Lys Val His Ser Scr Gly His Gin Asp He Arg Lys Ala Lys 
65 70 75 80 

Ser Lys Val Pro Arg Leu Leu He lie Gin Cys Pro Gin Gin Arg Glu 
85 90 9S 



<210> 338 
<211> 54 
<212> PRT 

<213> Homo sapiens 
<400> 338 

Gly pro Glu Glu Asn Leu Ser Pro Ser Thr Pro Ser Gin Met Pro Thr 
1 5 10 15 

He Trp Val Lys Leu Cys Leu Leu Gin Val Cys His Gly Leu Phe Pro 
20 25 30 

Leu Leu Lys His Trp Ser Gin Pro Met Pro Leu Cys Val Thr Leu Ala 
35 40 45 

Pro Val Ser Tyr Trp Leu 
50 



<210> 339 
<211> 287 
<212> PRT 

<213> Homo sapiens 
<400> 339 

Pro Arg Val Arg Lys Glu Pro Glu Ala Met Gin Trp Leu Arg Val Arg 
5 10 15 
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Clu Ser Pro Gly Glu Ala Thr Gly His Arg Val Thr Met Gly Thr Ala 
20 25 30 

Ala Leu Giy Pro Val Trp Ala Ala Leu Leu Leu Phe Leu Leu Met Cys 
35 40 45 

Glu He Pro Ket Val Glu Leu Thr Phe Asp Arg Ala Val Ala Ser Asp 
50 55 60 

Cys Gin Arg Cys Cys Asp Ser Glu Asp Pro Leu Asp Pro Ala His Val 
65 70 75 80 

Ser Ser Ala Ser Ser Ser Gly Arg Pro His Ala Leu Pro Glu He Arg 
85 . 90 95 

Pro Tyr Tie Asn He Thr He Leu Lys Gly Asp Lys Gly Asp Pro Gly 
100 r 105 no 

?ro Met Gly Leu Pro Gly Tyr Met Gly Arg Glu Gly Pro Gin Gly Glu 
115 120 125 

Pro Gly Pro Gin Gly Ser Lys Gly Asp Lys Gly Glu Met Gly Ser Pro 
!30 135 140 

Gly Ala Pro Cys Gin Lys Arg Phe Phe Ala Phe Ser Val Gly Arg Lys 
145 ISO 155 160 

Thr Ala Leu His Ser Gly Glu Asp Phe Gin Thr Leu Leu Phe Glu Arg 
165 170 175 

Val Phe Val Asn Leu Asp Gly Cys Phe Asp Met Ala Thr Gly Gin Phe 
180 185 190 

Ala Ala Pro Leu Arg Gly He Tyr Phe Phe Ser Leu Asn Val His Ser 
195 200 205 

Trp Asn Tyr Lys Glu Thr Tyr Val His He Met His Asn Gin Lys Glu 
210 215 220 

Ala Val He Leu Tyr Ala Gin Pro Ser Glu Arg Ser He Met Gin Ser 
225 230 235 240 

Gin Ser Val Met . Leu Asp Leu Ala Tyr Gly Asp Arg Val Trp Val Arg 
245 250 255 

Leu Phe Lys Arg Gin Arg Glu Asn Ala He Tyr Ser Asn Asp Phe Asp 
260 265 270 

Thr Tyr He Thr Phe Ser Gly His Leu He Lys Ala Glu Asp Asp 
275 2R0 2B5 



<210> 340 
<211> 339 
<212> PRT 

<213> Homo sapiens 



<400> 340 

Met Leu Tyr Pro Gly Ser Val Tyr Leu Leu Gin Lys Ala Leu Met Pro 
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1 5 10 15 

Val Leu Leu Gin Gly Gin Ala Arg Leu Val Glu Glu Cys Asn Gly Arg 
20 25 30 

Arg Ala Lys Leu Leu Ala Cys Asp Gly Asn Glu lie Asp Thr Met Phe 
35 40 45 

Val Asp Arg Arg Gly Thr Ala Glu Pro Gin Gly Gin Lvs Leu Val lie 
SO 55 60 

Cys Cys Glu Gly Asn Ala Gly Phe Tyr Glu Val Gly Cys Val Ser Thr 
65 70 75 80 

Pro Leu Glu Ala Gly Tyr Ser Val Leu Gly Trp Asn His Pro Gly ^he 
85 90 95 

? 

Ala Gly Ser Thr Gly val Pro Phe Pro Gin Asn Glu Ala Asn Ala Met 
100 105 no 

Asp Val Val Val Gin Phe Ala lie His Arg Leu Gly Phe Gin Pro Gin 
115 120 125 

Asp He He He Tyr Ala Trp Ser He Gly Gly Phe Thr Ala Thr Trp 
130 135 140 

Ala Ala Met Ser Tyr Pro Asp Val Ser Ala MeL He Leu Asp Ala Ser 
145 ISO 155 160 

Phe Asp Asp Leu Val Pro Leu Ala Leu Lys Val Met Pro Asp Ser Trp 
165 170 175 

Arg Gly Leu Val Tfcr Axg Thr Val Arg Gin His Leu Asn Leu Asn Asn 
180 185 190 

Ala Glu Gin Leu Cys Arg Tyr Gin Gly Pro Val Leu Leu He Arg Arg 
195 200 205 

Thr Lys Asp Glu He He Thr Thr Thr Val Pro Glu Asd He Met Ser 
210 215 220 

Asn Arg Gly Asn Asp Leu Leu Leu Lys Leu Leu Gin His Arg Tyr Pro 
225 230 235 240 

Arg Val Met Ala Glu Glu Gly Leu Arg Val Val Arg Gin Trp Leu Glu 
245 250 25S 

Ala Ser Ser Gin Leu Glu Glu Ala Ser He Tyr Ser Arg Trp Glu Val 
260 265 270 

Glu Glu Asp Trp Cys Leu Ser Val Leu Arg Ser Tyr Gin Ala Glu Kis 
2? 5 280 285 

Gly Pro Asp Phe Pro Trp Ser Val Gly Glu Asp Met Ser Ala Asp Gly 
290 295 300 

Arg Arg Gin Leu Ala Leu Phe Leu Ala Arg Lys His Leu His Asn Phe 
305 310 315 320 
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Glu Ala Thr His Cys Thx Pro Leu Pro Ala Gin Asn ?he Gin Met Pro 
325 330 335 • 

Trp His Leu 



<210> 341 
<211> 127 
<212> PRT 
<■ 2 1 3 > Homo sapiens 

<400> 341 

Val Cys Pro Lys Trp Cys Arg Phe Leu. Thr Met Leu Gly His Cys Cys 
15 10 15 

Tyr Phe Trp Gin Val Trp Pro Ala Ser Glu Ala Leu Ala Ala Gly Pro 
20 25 30 

Thr Pro Ser Thr Gly Ser Ser Ser Pro Ser Trp Lys Gin His He Gly 
35 40 45 

Thr Ser Leu Gin Lys Thr Arg Gly Ser Leu Pro Thr Thr Thr Leu Thr 
50 55 60 

Ser Gly Ala Gly Gin Ser Thr Ser Thr Gly Lys Asn Pro Ala Ala Gly 
65 70 75 80 

Arg Ser Leu Glu Gly Ala Leu Pro Ala Gly Val Trp Pro Cys Phe Ala 
B5 90 95 

Gin Ser Pro Cys Thr Gly Gly Gin Gin Thr Pro Ser Ser Thr Gly Leu 
iOO 105 HO 

Arg Ser Cys Leu Val Arg Ser Pro Ala Thr Trp Trp Arg Thr Pro 
21 5 120 125 



<210> 342 
<211> 554 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (16) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (109) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 342 

Trp lie Pro Arg Ala Ala Gly lie Arg His Glu lie Tyr Arg Glu Xaa 
15 10 15 

Asp Ser Glu Arg Ala Pro Ala Ser Val Pro Glu Thr Pro Thr Ala Val 
20 25 30 
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Thr Ala Pro His Ser Ser Ser Trp Asp Thr Tyr Tyr Gin Pro Arg Ala 
35 40 45 

Leu Glu Lys His Aia Asp Ser He Leu Ala Leu Ala Ser VaL Phe Trp 
50 55 60 

Ser He Ser Tyr Tyr Ser Ser Pro Phe Ala Phe Phe Tyr Leu Tyr Arg 
65 . 70 75 . BO 

Lys Gly Tyr Leu Ser Leu Ser Lys Val Val Pro Phe Ser His Tyr Ala 
85 90 95 

Gly Thr Leu Leu Leu Leu Leu Ala Gly . Val Ala Cys Xaa Arg Gly He 
100 105 no 

Gly Arg Trp Thr Asn Pro/'Gln Tyr Arg Gin Pha He Thr He Leu Glu 
US 120 125 

Ala Thr His Arg Asn Gin Ser Ser Glu Asn Lys Arg Gin Leu Ala Asn 
130 135 140 

Tyr Asn Phe Asp Phe Arg Ser Trp Pro Val Asp Phe His Trp Glu Glu 
145 150 155 160 

Pro Ser Ser Arg Lys Glu Ser Arg Gly Gly Pro Ser Arg Arg Gly Val 
165 170 175 

Ala Leu Leu Arg Pro Glu Pro Leu His Arg Gly Thr Ala Asp Thr Leu 
180 185 190 

Leu Asn Arg Val Lys Lys Leu Pro Cys Gin He Thr Ser Tyr Leu Val 
195 200 205 

Ala His Thr Leu Gly Arg Arg Met Leu Tyr Pro Gly Ser Val Tyr Leu 
210 215 220 

Leu Gin Lys Ala Leu Met Pro Val Leu Leu Gin Gly Gin Ala Arg Leu 
225 230 235 240 

Val Glu Glu Cys Asn Gly Arg Arg Ala Lys Leu Leu Ala Cys Asp Gly 
245 250 255 

Asn Glu He Asp Thr Met Phe Val Asp Arg Arg Gly Thr Ala Glu Pro 
260 265 270 

Gin Gly Gin Lys Leu Val He Cys Cys Glu Gly Asn Ala Gly Phe Tyr 
2? 5 280 285 

Glu Val Gly Cys Val Ser Thr Pro Leu Glu Ala Gly Tyr Ser Val Leu 
290 295 300 

Gly Trp Asn His Pro Gly Phe Ala Gly Ser Thr Gly Val Pro Phe Pro 
305 310 315 320 

Gin Asn Glu Ala Asn Ala Met Asp Val Val Val Gin Phe Ala He Mis 
325 330 335 

Arg Leu Gly Phe Glr. Pro Gin Asp lie He lie Tyr Ala Trp Ser lie 



WO 00/06698 



229 



PCT/US99/17I30 



340 345 350 

Gly Gly Phe Thr Ala Thr Trp Ala Ala Met Ser Tyr Pro Asp Val Ser 
355 360 365 

Ala Met He Leu Asp Ala Ser Phe Asp Asp Leu Val Pro Leu Ala r.eu 
370 375 380 

Lys Val Met Pro Asp Ser Trp Arg Gly Leu val Thr Arg Thr val Arg 
385 390 395 400 

G In His Leu Asn Leu Asn Asn Ala Glu Gin Leu Cys Arg Tyr Gin Gly 
405 410 415 

Pro Val Leu Leu He Arg Arg Thr Lys Asp Glu He He Thr Thr Thr 
420 425 430 

$ 

Val Pro Glu Asp He Met Ser Asn Arg Gly Asn Asp Leu Leu Leu Lys 
435 440 445 

Leu Leu Gin His Arg Tyr Pro Arg Val Met Ala Glu Glu Gly Leu Arg 
450 455 460 

Val Val Arg Gin Trp Leu Glu Ala Ser Ser Gin Leu Glu Glu Ala Ser 
465 470 475 4B0 

He Tyr Ser Arg Trp Glu Val Clu Clu Asp Trp Cys Leu Ser Val Leu 
485 490 495 

Arg Ser Tyr Gin Ala Glu His Gly Pro Asp Phe Pro Trp Ser Val Gly 
500 505 510 

Glu Asp Met Ser Ala Asp Gly Arg Arg Gin Leu Ala Leu Phe Leu Ala 
515 520 525 

Arg Lys His Leu His Asn Phe Glu Ala Thr His Cys Thr Pro Leu Pro 
530 535 540 

Ala Gin Asn Phe Gin Met Pro Trp His Leu 
545 550 



<210> 343 
<211> 225 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (5) 

<223> Xaa equals any of the naturally occurring L-araino acids 
<40O> 343 

His Glu Arg Ala Xaa Gly Pro Ser Arg Gly His Gly Glu Leu Leu Ser 
1 5 10 15 

Cys Val Leu Gly Pro Arg Leu Tyr Lys He Tyr Arg Glu Arg Asp Ser 
20 25 30 
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Glu Arg Ala Pro Ala Ser Val Pro Glu Thr Pro Thr Ala Val "hr Ala 
35 40 45 

Pro His Ser Ser Ser Trp Asp Thr Tyr Tyr Gin Pro Arg Ala Leu Glu 
50 55 60 

Lys His Ala Asp Ser He Leu Ala Leu Ala Ser Val Phe Trp Ser He 
65 ™ 75 80 

Ser Tyr Tyr Ser Ser Pro Phe Ala Phe Phe Tyr Leu Tyr Arg Lys Gly 
85 90 95 

Tyr Leu Ser Leu Ser Lys Val Val Pro Phe Ser His Tyr Ala Gly Thr 
100 105. no 

Leu Leu Leu Leu Leu Ala Gly Val Ala Cys Ser Glu Ala Leu Ala Ala 
115 " 120 125 

Gly Pro Thr Pro Ser Thr Gly Ser Ser Ser Pro Ser Trp Lys Gin His 
130 135 140 

lie Gly Thr Ser Leu Gin Lys Thr Arg Gly Ser Leu Pro Thr Thr Thr 
145 150 155 l60 

Leu Thr Ser Gly Ala Gly Gin Ser Thr Ser Thr Gly Lys Asn Pro Ala 
165 170 i7 5 

Ala Gly Arg Ser Leu Glu Gly Ala Leu Pro Ala Gly Val Trp Pro Cys 
180 185 190 

Phe Ala Gin Ser Pro Cys Thr Gly Gly Gin Gin Thr Pro Ser Ser Thr 
195 200 205 

Gly Leu Arg Ser Cys Leu val Arg Ser Pro Ala Thr Trp Trp Arg «hr 
210 215 220 

Pro 
225 



<210> 344 
<211> 299 
<212> PRT 

<213> Homo sapiens 
<400> 344 

Met Phe Lys Arg His Gin Arg Leu Lys Lys Asp Ser Thr Gin Ala Glu 
. 1 5 10 15 

Glu Asp Leu Ser Glu Gin Glu Gin Asn Gin Leu Asn Val Leu Lys Lys 
20 25 30 

His Gly Tyr Val Val Gly Arg val Gly Arg Thr Phe Leu Tyr Ser Glu 
35 40 45 

Glu Gin Lys Asp Asn He Pro Phe Glu Phe Asp Ala Asp Ser Lftu Ala 
50 . 55 6 o 

Phe Asp Met Glu Asn Asp Pro Val Met Gly Thr His Lys Ser Thr Lys 
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65 70 75 80 

Gin Val Glu Leu Thr Ala Gin Asp Val Lys Asp Ala His Trp Phe Tyr 
85 90 95 

Asp Thr Pro Gly He Thr Lys Glu Asn Cys He Leu Asn Leu r,eu Thr 
100 105 no 

Glu Lys Glu Val Asn lie val Leu Pro Thr Gin Ser lie Val Pro Arg 
115 120 125 

Thr Phe Val Leu Lys Pro Gly Met Val Leu Phe Leu Gly Ala He Gly 
130 135 140 

Arg He Asp Phe Leu Gin Gly Asn Gin Ser Ala Trp Phe Thr Val Val 
145 150 155 160 

Ala Ser Asn He Leu Pro Val His He Thr Ser Leu Asp Arg Ala Asp 
165 170 175 

Ala Leu Tyr Gin Lys His Ala Gly His Thr Leu Leu Gin He Pro Met 
180 185 190 

Gly Gly Lys Glu Arg Met Ala Gly Phe Pro Pro Leu Val Ala Glu Asp 
195 200 205 

Ha Met Leu Lys Glu Gly Leu Gly Ala Ser Glu Ala Val Ala Asp He 
210 215 220 

Lys Phe Ser Ser Ala Gly Trp Val Ser Val Thr Pro Asn Phe Lys Asp 
225 230 235 240 

Arg Leu His Leu Arg Gly Tyr Thr Pro Glu Gly Thr Val Leu Thr Val 
245 250 255 

Arg Pro Pro Leu Leu Pro Tyr He Val Asn He Lys Gly Gin Arg He 
260 265 270 

Lys Lys Ser Val Ala Tyr Lys Thr Lys Lys Pro Pro Ser Leu Met Tyr 
275 280 285 

Asn Val Arg Lys Lys Lys Gly Lys He Asn Val 
290 295 



<210> 345 
<211> 314 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (147) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (211) 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<400> 34S 

Met Leu Pro Ala Arg Leu Pro Phe Arg Leu Leu Ser Leu Phe Leu Arg 
1 5 io 15 

Gly Ser Ala Pro Thr Ala Ala Arg His Gly Leu Arg Glu Pro Leu Leu 
20 25 30 

Glu Arg Arg Cys Ala Ala Ala Ser Ser Phe Gin His Ser Ser Ser Leu 
35 40 45 

Gly Arg Glu Leu Pro Tyr Asp Pro Val Asp Thr Glu Gly Phe Gly Glu 
5 ° 55 60 

Gly Gly Asp Met Gin Glu Arg Phe Leu Phe Pro Glu Tyr lie Leu Asp 
65 70 75 so 

f 

Pro Glu Pro Gin Pro Thx Arg Glu Lys Gin Leu Gin Glu Leu Gin Gin 
85 90 95 

Gin Gin Glu Glu Glu Glu Arg Gin Arg Gin Gin Arg Arg Glu Glu Arg 
100 105 no 

Arg Gin Gin Asn Leu Arg Ala Arg Ser Arg Glu His Pro Val Val Gly 
115 120 125 

His Pro Asp Pro Ala Leu Pro Pro Ser Gly Val Asn Cys Ser Gly Cys 
130 135 140 

Gly Ala Xaa Leu His Cys Gin Asp Ala Gly Val Pro Gly Tyr Leu Pro 
14S 150 155 160 

Arg Glu Lys Phe Leu Arg Thr Ala Glu Ala Asp Gly Gly Leu Ala Arg 
165 170 175 

Thr Val Cys Gin Arg Cys Trp Leu Leu Ser His His Arg Arg Ala Leu 
18 ° 185 190 

Arg Leu Gin Val Ser Arg Glu Gin Tyr Leu Glu Leu Val Ser Ala Aid 
195 200 205 

Leu Arg Xaa Pro Gly Pro Ser Leu Val Leu Tyr Met Val Asp Leu Leu 
210 215 220 

Asp Leu Pro Asp Ala Leu Leu Pro Asp Leu Pro Ala Leu Val Gly Pro 
225 230 235 240 

Lys Gin Lgu lie Val Leu Gly Asn Lys Val Asp Leu Leu Pro Gin Asp 
245. 250 255 

Ala Pro Gly Tyr Arg Gin Arg Leu Arg Glu Arg Leu Trp Glu Asp Cys 
260 265 270 

Ala Arg Ala Gly Leu Leu Leu Ala Pro Gly Thr Lys Gly His Ser Ala 
2? 5 280 285 

Pro Ser Arg Thr Sar His Arg Thr Gly Arg lie Arg He Arg Arg Thr 
290 295 300 
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Gly pro Ala Gin Trp Ser Gly Thr Cys Gly 

305 310 , 

<210> 346 
<211> 380 
<212> PRT 

<213> Homo sapiens 
<400> 346 

Pro Ser Phe Arg Arg Glu Arg Val Glu Thr Gly Gly Gly Gly Pro Val 
1 5 10 15 

Thr His Gly Tar Glu Gly Pro Phe Leu Pro Leu Pro Gly Gly Thr Arg 
20 25 30 

Met Asn Met Thr Gin AI41 Arg Val Leu Val Ala Ala Val Val Gly Leu 
35 40 45 

Val Ala Val Leu Leu Tyr Ala Ser lie His Lys lie Glu Glu Gly His 

50 S5 GO ^ 

Leu Ala Val Tyr Tyr Arg Gly Gly Ala Leu Leu Thr ser Pro Ser Gly 
55 70 75 eo 

Pro Gly Tyr His He Met Leu Pro Phe lie Thr Thr Phe Arg Ser Val 
85 90 95 

Gin Thr Thr Leu Gin Thr Asp Glu Val Lys Asr. Val Pro Cys Gly Thr 
10O 105 no 

Ser Gly Gly Val Met lie Tyr lie Asp Arg He Glu Val Val Asn Met 
115 120 125 

Leu Ala Pro Tyr Ala Val Phe Asp lie Val Arg Asn Tyr Thr Ala A^p 
130 135 140 

Tyr Asp Lys Thx Leu lie Phe Asn Lys He His His Glu Leu Asn Gin 
145 150 155 160 

Phe Cys Ser Ala His Thr Leu Gin Glu Val Tyr He Glu Leu Phe Asp 
165 170 175 

Gin He Asp Glu Asn Leu Lys Gin Ala Leu Gin Lys Asp Leu Asn Leu 
180 185 190 

Met Ala Pro Gly Leu Thr He Gin Ala Val Arg Val Thr Lys Pro Lys 
195 200 20b 

He Pro Glu Ala lie Arg Arg Asn Phe Glu Leu Met Glu Ala Glu Lvs 
210 215 220 

Thr Lys Leu Leu He Ala Ala Gin Lys Gin Lys Val Val Glu Lys Glu 
225 230 235 240 

Ala Glu Thr Glu Arg Lys Lys Ala Val He Glu Ala Glu Lys He Ala 
245 250 255 

Gin Val Ala Lys He Arg Phe Gin Gin Lys Val Met Glu Lys Glu Thr 
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260 265 270 

Glu Lys Arg lie Ser Glu lie Glu Asp Ala Ala Phe Leu Ala Arg Glu 
275 280 285 

Lys Ala Lys Ala Asp Ala Glu Tyr Tyr Ala Ala His Lys Tyr Ala Thr 
290 295 300 

Ser Asn Lys His Lys Leu Thr Pro Glu Tyr Leu Glu Leu Lys Lys Tyr 
305 310 315 320 

Gin Ala lie Ala Ser Asn Ser Lys lie Tyr Phe Gly Ser Asn lie Pro 
325 330 335 

Asn Met Phe Val Asp Ser Ser Cys Ala Leu Lys Tyr Ser Asp lie Arg 
340 345 350 

Thr Gly Arg Glu Ser Ser Leu Pro Ser Lys Glu Ala Leu Glu Pro Ser 
355 360 365 

Gly Glu Asn Val lie Gin Asn Lys Glu Ser Thr Gly 
370 375 380 



<210> 347 
<211> 422 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (328) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 347 

Trp Ser Thr Gly Asn Ala Ser Trp Glu Lys Lys Asp Asn Phe lie Leu 
15 10 15 

Ser Ala Asp Phe Glu Met Met Gly Leu Gly Asn Gly Arg Arc Ser Met 
20 25 30 

Lys Ser Pro Pro Leu Val Leu Ala Ala Leu Val Ala Cys lie He Val 
35 40 45 

Leu Gly Phe Asn Tyr Trp He Ala Ser Ser Arg Ser Val Asp Leu Gin 
50 55 60 

Thr Arg He Met Glu Leu Glu Gly Arg Val Arg Arg Arg Ala Ala Glu 
65 70 75 80 

Arg Gly Ala Val Glu Leu Lys Lys Asn Glu Phe Cln Gly Glu Leu Glu 
85 90 95 

Lys Gin Arg Glu Gin Leu Asp Lys tie Gin Ser Ser His Asn Phe Gin 
100 105 no 

Leu Glu Ser Val Asn Lys Leu Tyr Gin Asp Glu Lys Ala Val Leu Val 
115 120 125 
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Asn Asn lie Thr Thr Gly Glu Arg Leu lie Arg Val Leu Gin Asp Gin 
130 135 140 

Leu Lys Thr Leu Gin Arg Asn Tyr Gly Arg Leu Gin Gin Asp Val Leu 
-45 150 155 160 

Gin Phe Gin Lys Asn Gin Thr Asn Leu Glu Arg Lys Phe Ser Tyr Asp 
165 170 175 

Leu Ser Gin Cys He Asn Gin Met Lys Glu Val Lys Glu Gin Cys Glu 
180 185 190 

Glu Arg He Glu Glu Val Thr Lys Lys Gly Asn Glu Ala Val Ala Ser 
195 200 205 

Arg Asp Leu Ser Glu Asn Asn Asp Gin Arg Gin Gin Leu Gin Ala Leu 
210 2fs 220 

Ser Glu Pro Gin Pro Arg Leu Gin Ala Ala Gly Leu Pro His Thr Glu 
225 230 235 240 

Val Pro Gin Gly Lys Gly Asn Val Leu Gly Asn Ser Lys Ser Gin Thr 
245 250 255 

Pro Ala Pro Ser Ser Glu Val Val Leu Asp Ser Lys Arg Gin Val Glu 
260 265 270 

Lys Glu Glu Thr Asn Glu He Gin Val Val Asn Glu Glu Pro Gin Arg 
275 280 285 

Asp Arg Leu Pro Gin Glu Pro Gly Arg Glu Gin Val Val Glu Asp Arg 
290 295 300 

Pro Val Gly Gly Arg Gly Phe Gly Gly Ala Gly Glu Leu Gly Gin Thr 
305 310 315 320 

Pro Gin Val Gin Ala Ala Leu Xaa Val Ser Glr. Glu Asn Pro Glu Met 
325 330 335 

Glu Gly Pro Glu Arg Asp Gin Leu Val lie Pro Asp Gly Gin Glu Glu 
340 345 350 

Glu Gin Glu Ala Ala Gly Glu Gly Arg Asn Gin Gin Lys Leu Arg Gly 
355 360 365 

Glu Asp Asp Tyr Asn Met Asp Glu Asn Glu Ala Glu Ser Glu Thr Asp 
370 375 380 

Lys Gin Ala Ala Leu Ala Gly Asn Asp Arg Asn lie Asp Val Phe Asn 
305 390 395 400 

Val Glu Asp Gin Lys Arg Asp Thr lie Asn ^eu Leu Asp Gin Arg Glu 
405 410 415 

Lys Arg Asn His Thr Leu 
420 



<210> 346 



WO 00/06698 



PCT/US99/17130 



236 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 348 

Ser Leu His Arg Phe Val Leu Ser Gin Ala Lys Asp Glu Leu 
15 10 



<210> 349 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 349 

Phe He Lys Phe Phe Ala Pro Trp Cys Gly His Cys Lys Ala Leu Ala 
1 5 *" 10 15 

Pro Thr Trp 



<210> 350 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 350 

Phe He Lys Phe Tyr Ala Pro Trp Cys Gly His Cys Lys Thr Leu Ala 

1 ■ 5 10 :s 

Pro Thr Trp 



<210> 351 
<211> 363 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 

<222> (42) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<400> 351 

Arg Arg Gly Arg Gly Val Pro Gly Pro Arg Gly Arg Arg Arg Leu Trp 
1 5 10 15 

Ser Ala Ala Cys Gly His Cys Gin Arg Leu Gin Pro Thr Trp Asn Asp 

20 25 30 \ 

Leu Gly Asp Lys Tyr Asn Ser Met Glu Xaa Ala Lys Val Tyr Val Ala 
35 40 45 

Lys Val Asp Cys Thr Ala His Ser Asp Val Cys Ser Ala Gin Gly Val 
50 55 60 

Arg Gly Tyr Pro Thr Leu Lys Leu Phe Lys Pro Gly Gin Glu Ala Val 
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65 70 75 80 

Lys Tyr Gin Gly Pro Arg Asp Phe Gin Thr Leu Glu Asn Trp Met Leu 
85 90 95 

Gin Thr Leu Asn Glu Glu Pro Val Thr Pro Glu Pro Glu Val Glu Pro 
100 105 110 

Pro Ser Ala Pro Glu Leu Lys Gin Gly Leu Tyr Glu Leu Ser Ala Ser 
115 120 125 

Asn Phe Glu Leu His Val Ala Gin Gly Asp His Phe lie Lys Phe Phe 
130 135 140 

Ala Pro Trp Cys Gly His Cys Lys Ala Leu Ala Pro Thr Trp Glu Gin 
145 150 155 160 

Leu Ala Leu Gly Leu g!u His Ser Glu Thr Val Lys lie Gly Lys Val 
165 170 175 

Asp Cys Thr Gin His Tyr Glu Leu Cys Ser Gly Asn Gin Val Arg Gly 
180 185 . 190 

Tyr Pro Thr Leu Leu Trp Phe Arg Asp Gly Lys Lys Val Asp Gin Tyr 
195 200 205 

Lys Gly Lys Axg Asp Leu Glu Ser Leu Arg Glu Tyr Val Glu Ser Gin 
210 215 220 

Leu Gin Arg Thr Glu Thr Gly Ala Thr Glu Thr Val Thr Pro Ser Glu 
225 230 235 240 

Ala Pro val Leu Ala Ala Glu Pro Glu Ala Asp Lys Gly Thr Val Leu 
245 250 255 

Ala Leu Thr Glu Asn Asn Phe Asp Asp Thr lie Ala Glu Gly lie Thr 
260 265 270 

Phe He Lys Phe Tyr Ala Pro Trp cys Gly His Cys Lys Thr Leu Ala 
275 2R0 285 

Pro Thr Trp Glu Glu Leu Ser Lys Lys Glu Phe Pro Gly Leu Ala Gly 
290 295 300 

Val Lys lie Ala Glu Val Asp Cys Thr Ala Glu Arg Asn lie Cys Ser 
305 310 315 320 

Lys Tyr Ser Val Arg Gly Tyr Pro Thr Leu Leu Leu Phe Arg Gly Gly 
325 330 335 

Lys Lys Val Ser Glu His Ser Gly Gly Arg Asp Leu Asp Ser Leu His 
340 345 350 

Arg Phe Val Leu Ser Gin Ala Lys Asp Glu Leu 
355 360 



<210> 352 
<211> 93 
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<212> PRT 

<213> Homo sapiens 
<400> 352 

Met Arg Pro Gin Gly Pro Ala Ala Ser Pre Gin Arg Leu Arg Gly Leu 
1 5 io 15 

Leu Leu Leu Leu Leu Leu Gin Leu Pro Ala Pro Ser Ser Ala Ser Glu 
20 .25 30 

He Pro Lys Gly Lys Gin Lys Ala His Ser Gly Arg Gly Arg Trp Trp 
35 40 45 

Thr Cys lie Met Glu Cys Ala Tyr LysGly Gin Gin Glu Cys Leu Val 
50 55 60 

Glu Thr Gly Ala Leu Gly .Pro Met Ala Phe Arg Val His Leu Gly Ser 
65 70 75 80 

Gin Val Gly Met Asp Ser r.ys Glu Lys Arg Gly Asn Val 
85 go 



<210> 353 
<211> 273 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> STTE 
<222> (210) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 353 

Glu Thr Arg Val Lys Thr Ser Leu Glu Lew Leu Arg Thr Gin Leu Glu 
1 - 5 10 15 

Pro Thr Gly Thr Val Gly Asn Thr lie Met Thr Ser Gin Pro Val Pro 
20 25 30 

Asn Glu Thr lie He Val leu Pro Ser Asn Val He Asn Phe Ser Gin 
35 40 45 

Ala Glu Lys Pro Glu Pro Thr Asn Gin Gly Gin Asp Ser Leu Lys Lys 
50 55 60 

Hi3 Leu His Ala Glu He Lys Val rie Gly Thr Zle Gin He Leu Cys 
65 *'U 75 80 

Gly Met Met Val Leu Ser Leu Gly lie He Leu Ala Ser Ala Ser Phe 
85 90 95 

Ser Pro Asn Phe Thr Gin Val Thr Ser Thr Leu Leu Asn Ser Ala Tyr 
100 105 no 

Pro Phe He Gly Pro Phe Phe Phe He lie Ser Gly Ser Leu Ser He 
115 120 125 

Ala Thr Glu Lys Arg Leu Thr Lys Leu Leu Val His S*r Ser Leu Val 
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130 135 140 

Gly Ser lie Leu Ser Ala Leu Ser Ala Leu Val Gly Phe lie lie Leu 
145 150 155 160 

Ser Val Lys G3n Ala Thr Leu Asn Pro Ala Ser Leu Gin Cys Glu Leu 
165 170 175 

Asp Lys Asn Asn lie Pro Thr Arg Ser Tyr Val Ser Tyr Phe Tyr His 
180 185 190 

Asp Ser Leu Tyr Thr Thr Asp Cys Tyr Thr Ala Lys Ala Ser Leu Ala 
195 200 205 

Gly Xaa Leu Ser Leu Met Leu lie Cys Thr Leu Leu Glu Phe Cys Leu 
210 215 220 

Ala Val Leu Thr Ala Val Leu Arg Trp Lys Gin Ala Tyr Ser Asp Phe 
225 230 235 240 

Pro Gly Ser Val Leu Phe Leu Pro His Ser Tyr lie Gly Asn Ser Gly 
245 250 255 

Met Ser Ser Lys Met Thr His Asp Cys Gly Tyr Glu Glu Leu Leu Thr 
260 265 270 



<210> 354 
<211> 192 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (129) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 354 

Met Met Val Leu Ser Leu Gly He He Leu Ala Ser Ala Ser Phe Ser 
15 10 15 

Pro Asn Phe Thr Gin Val Thr Ser Thr Leu Leu Asn Ser Ala Tyr Pro 
20 2S 30 

Phe He Gly Pro Phe Phe Phe He He Ser Gly Ser Leu Ser He Ala 
35 40 45 

Thr Glu Lys Arg Leu Thr Lys Leu Leu Val His Ser Ser Leu Val Gly 
SO 55 60 

Ser He Leu Ser Ala Leu Ser Ala Leu Val Gly Phe He He Leu Ser 
65 70 75 80 

Val Lys Gin Ala Thr Leu Asn Pro Ala Ser Leu Gin cys Glu Leu Asp 
85 90 95 
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Lys Asn Asn lie Pro Thr Arg Ser 
100 

Ser Leu Tyr Thr Thr Asp Cys Tyr 
115 120 

Xaa Leu Ser Leu Met Leu lie Cys 
130 135 

Val Leu Thr Ala Val Leu Arg Trp 
145 150 

Gly ser Val Leu Phe Leu Pro His 
165 

Ser Ser Lys Met Thr His Asp Cys 

ibo ; 



240 



Tyr Val Ser Tyr Phe Tyr His Asp 
105 HO 

Thr Ala Lys Ala Ser Leu Ala Gly 
125 

Thr Leu Leu Glu Phe Cys Leu Ala 
140 

Lys Gin Ala Tyr Ser Asp Phe Pro 
155 160 

Ser Tyr lie Gly Asn Ser Gly Met 
170 175 

Gly Tyr Clu Glu Leu Leu Thr Ser 
1B5 190 



<210> 355 

<211> 204 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (119) 

<223> Xaa equals any of the naturally occurring L- amino acids 

<400> 355 

Gly Ala Ser Cys Glu Gly Gly Gly Ala Ala Ala Arg Ala Ala Leu Gly 
1 5 10 15 

Val His Arg Ser Gin Lys Ala Leu Leu Val Phe Arg Arg Thr Leu Ser 
20 25 30 

Asn Leu Leu Tyr Met Pro Leu Leu Arg Gly Leu Leu Trp Leu Gin Val 
35 40 45 

Leu Cys Ala Gly Pro Leu His Thr Glu Ala Val Val Leu Leu Val Pro 
50 55 60 

Ser Asp Aop Gly Arg Ala Phe Leu Leu Arg Ser Arg Leu Leu His Pro 
65 70 75 80 

Glu Ala His Val Pro Pro Ala Ala Asp Arg Gly Ala Ser Leu Gin Cys 
85 90 95 

Val Leu His Gin Ala Ala Pro Lys ser Arg Pro Arg Ser Pro Ala Ala 
100 105 110 

Gly Ala Ala Leu Leu His Xaa Pro Arg Arg Thr Gly Asp Glu Pro Cys 
115 120 125 

Arg Glu Phe His Gly Asn Gly Phe Pro Gly Pro Thr Gin Leu Thr Pro 
130 135 140 
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Gly Glu cys Gly Leu Pro Ala Pro Ser Ser Leu Leu Gin His Ala Ser 
145 150 155 160 

Ala Pro Val Arg Thr Gly Ser Glu Gly Gin Val Val Gly Cys Pro Arg 
165 170 175 

Ala Arg Gly Glu Thr Gly Glu Gly Leu Ser Leu Ala Phe Leu Ser Ser 
180 185 190 

Leu Met Phe Thr Ser Arg Asn Gly Leu Val Gly Cys 
195 200 



<210> 356 
<211> 72 
<212> PRT 

<213> Homo sapiens 
<400> 356 

Met Gly Ser Ala Ala Leu Glu lie Leu Gly Leu Val Leu Cys Leu Val 
15 10 15 

Gly Trp Gly Gly Leu lie Leu Ala Cys Gly Leu Pro Met Trp Gin Val 
20 25 30 

Thr Ala Phe Leu Asp His Asn He Val Thr Ala Gin Thr Thr Trp Lys 
35 40 45 

Gly Leu Trp Met Ser Cys Val Val Gin Ser Thr Gly Thr Cys Ser Ala 
50 55 GO 

Lys Cys Thr Thr Arg Cys T^-p t e » 
65 70 



<210> 357 
<211> 115 
<212> PRT 

<213> Homo sapiens 
<40O> 357 

Leu Lys Arg Ala Pro Pro Gly Pro Ala Leu Ala Lys Gly Leu Leu Gin 
1 5 10 15 

Pro Ser Ser Thr Phe Gin Ala Leu Glu Thr Asr. He Gly Asp Gin Veil 
20 25 30 

Arg Arg His Ser Thr Ala Val Val lie Arg Glu Met Thr Ser Tyr He 
35 40 45 

Leu lie Ser Phe Val Leu Leu He Gly Val Gly Cys lie Glu Lys Asp 
50 55 60 

Gin Ser Cys Pro Val Phe Gly Gly Arg Lys Arg Leu His Leu Lea Phe 
65 70 75 80 

Val Gly Gly Gin Leu Arg Gin Val Arg Met Leu Arg Gly Glu Leu Ser 
85 90 95 
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Cys Ala Cys Tyr Arg Pro His Val Gin Ala Leu Gin Leu Gly Gly Cys 
100 105 110 

Thr Cys Phe 
115 



<210> 358 
<211> 88 
<212> PRT 

<213> Homo sapiens 
<400> 358 

Val lie Lys Leu He Cys Pro Ala Ala Phe Pro Val Tyr Phe Gin Asp 
I S 10 15 

Met Ala Arg Gly Cys Val Cys Ser Leu Cys Ala Ser Val Cys lie Phe 
20 25 30 

Leu Ser Ser Leu Phe Pro Leu Leu Pro Ser Val His Ser Val Asn lie 
35 40 45 

lie Ser Cys Leu Leu Leu Ser Lys Cys Phe Glu Gly Leu Glu Leu Met 
50 55 60 

Cys Glu His Leu Tyr Gin Leu ser Gin Leu His Val Leu His His He 
65 70 75 80 

Phe Ser Tyr Leu Leu Cys Thr Pro 
85 



<210> 359 

<?.11> 716 

<212> PRT 

<213> Homo sapiens 

SITE 
(2) 

Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (373) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (705) 

<223> xaa equals any of the naturally occurring L-amino acids 
<400> 359 

Tyr Xaa He Pjto Gly Ser Thr His Ala Ser Gly Arg Gin Arg Glv Ser 
1 5 10 lb 



<220> 
<221* 
<222* 
<223> 



ly Arg Gly Glu Asp Asp Ser Gly Pro Pro Pro Ser Thr Val He Asn 
20 25 30 
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Gin Asn Glu Thr Phe Ala Asn He lie Phe Lys Pro Thr Val Val Glr 
35 4Q 45 

Gin Ala Arg lie Ala Gin Asn Gly lie Leu Gly Asp Phe lie He Aro 
50 55 so 

Tyr Asp Val Asn Arg Glu Gin Ser He Gly Asp He Gin Val Leu Asn 
65 70 75 80 

Gly Tyr Phe Val His Tyr Phe Ala Pro Lys Asp Leu Pro Pro Leu Pro 
85 90 95 

Lys Asn Val Val Phe Val Leu Asp Ser Ser Ala Ser Met Val Gly Thr 
100 105 no 

Lys Leu Arg Gin Thr Lys A^p Ala Leu Phe Thr He Leu His Asp Leu 
115 120 125 

Arg Pro Gin Asp Arg Phe 9er He He Gly Phe Ser Asn Arg lie Lys 
130 135 140 

Val Trp Lys Asp His Leu He Ser Val Thr Pro Asp Ser He Arg Asp 
145 150 155 16 o 

Gly Lys Val Tyr He His His Met Ser Pro Thr Gly Gly Thr Asp He 
165 170 175 

Asn Gly Val Leu Gin Arg Ala He Arg Leu Leu Asn Lys Tyr Val Ala 
1B0 185 190 

His Ser Gly. He Gly Asp Arg Ser Val Ser Leu He Val Phe Leu Thr 
195 200 205 

Asp Gly Lys Pro Thr Val Gly Glu Thr His Thr Leu Lys He Leu Asn 
210 215. 220 

Asn Thr Arg Glu Ala Ala Arg Gly Gin Val Cys lie Phe Thr He Gly 
225 230 235 240 

He Gly Asn Asp Val Asp Phe Arg Leu Leu Glu Lys Leu Ser Leu Glu 
245 250 255 

Asn Cys Gly Leu Thr Arg Arg Val His Glu Glu Glu Asp Ala Gly Ser 
260 265 270 

Gin Leu He Gly Phe Tyr Asp Glu He Arg Thr Pro Leu Leu Se- Asp 
275 280 235 

He Arg He Asp Tyr Pro Pro Ser Ser Val Val Gin Ala Thr Lys Thr 
290 295 300 

Leu Phe Pro Asn Tyr Phe Asn Gly Ser Glu He He He Ala Gly Ly* 
305 315 Y 320 

Leu Val Asp Arg Lys Leu Asp His Leu His val Glu Val Thr Ala Ser 
325 330 335 

Asn Ser Lys Lys Phe He He Leu Lys Thr Asp Val Pro Val Arg Pro 
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340 



34S 



Gin Lys Ala Gly Lys Asp Val Thr Gly Ser Pro Arg Pro Gly Gly Asp 
355 360 365 

Gly Glu Gly Asp Xaa Asn His He Glu Arg Leu Trp Ser Tyr Leu Thr 
370 375 380 

Thr Lys Glu Leu Leu Ser Ser Trp Leu Gin Ser Asp Asp Glu Pro Glu 
385 390 395 400 

Lys Glu Arg Leu Arg Gin Ary Ala Gin Ala Leu Ala Val Ser Tyr Arg 
405 410 415 

Phe Leu Thr Pro Phe Thr Ser Met Lys Leu Arg Gly Pro Val Pro Arg 
42 <> 425 430 

Met Asp Gly Leu Glu Glu Ala His Gly Met Ser Ala Ala Met Gly Pro 
435 440 445 

Glu Pro val Val Gin Ser Val Arg Gly Ala Gly Thr Gin Pro Gly Pro 
450 455 460 

Leu Leu Lys Lys Pro Tyr Gin Pro Arg He Lys He Ser Lys Thr Ser 
465 470 475 480 

Vol Asp Gly Asp Pro His Phe Val val Asp Phe Pro Leu Ser Arg Leu 
485 490 495 

Thr Val Cys Phe Asn Tie Asp Gly Gin Pro Gly Asp He Leu Arg Leu 
500 505 510 

Val ser Aso His Arg Asp Ser Gly Val Thr Val Asn Gly Glu Leu He 
515 520 525 

Gly Ala Pro Ala Pro Pro Asn Gly His Lys Lys Gin Arg Thr Tyr Leu 
530 535 540 

Arg Thr He Thr He Leu He Asn Lys Pro Giu Arg Ser Tyr Leu Glu 
545 550 555 560 

He Thr Pro Ser Arg Val He Leu Asp Gly Gly Asp Arg Leu Val Leu 
565 570 575 

Pro Cys Asn Gin Ser Val Val Val Gly Ser Trp Gly Leu Glu Val Ser 
580 S85 590 

Val ser Ala Asn Ala Asn Val Thr Val Thr He Gin Gly Ser He Ala 
595 600 60S 

Phe Val He Leu He His Leu Tyr Lys Lys Pro Ala Pro Phe Gin Arc 
610 615 620 

His His Leu Gly Phe Tyr He Ala Asn Ser Glu Gly Leu Ser Ser Asn 
625 630 635 640 

Cys His Gly Leu Leu Gly Gin Phe Leu Asn Gin Asp Ala Arg Leu Thr 
645 650 655 
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Glu Asp Pro Ala Gly Pro Ser Gin Asn Leu Thr His Pro Leu Leu Leu 
660 665 670 

Gin Val Gly Glu Gly Pro Glu Ala Val Leu Thr Val Lys Gly His Gin 
675 680 gas 

Val Pro Val Val Trp Lys Gin Arg Lys He Tyr Asn Gly Glu Glu Gin 
690 695 700 

Xaa Asp Cys Trp Phe Ala Arg Asn Met Pro Pro Asn 
705 710 715 

<210> 360 
<211> 387 
<212> PRT 

<213> Homo sapiens 
<400> 360 

Pro Arg Val Arg Ser He Lys Val Thr Glu Leu Lys Gly Leu Ala Asn 
1 5 io 15 

His val Val Val Gly Ser Val Ser Cys Glu Thr Lys Asp Leu Phe Ala 
20 25 30 

Ala Leu Pro Gin Val Val Ala Val Asp lie Asn Asp Leu Gly Thr He 
35 40 45 

Lys Leu Ser Leu Glu Val Thr Trp Ser Pro Phe Asp Lys Asp Asp Gin 
50 55 60 

Pro Ser Ala Ala Ser Ser Val Asn Lys Ala Ser Thx Val Thr Lya Arg 
65 7 <> 75 80 

Phe Ser Thr Tyr Ser Gin Ser Pro Pro Asp Thr Pro Ser Leu Arg Glu 
85 90 95 

Gin Ala Phe Tyr Asn Met Leu Arg Arg Gin Glu Glu Leu Glu Asn Gly 
100 105 1L0 

Thr Ala Trp Ser Leu Ser Ser Glu Ser Ser Asp Asp Ser Ser Ser Pro 
115 120 125 

Gin Leu Ser Gly Thr Ala Arg His Ser Pro Ala Pro Arg Pro Leu Val 
130 135 140 

Gin Gin Pro Glu Pro Leu Pro lie Gin Val Ala Phe Arg Arg Pro Glu 
145 155 160 

Thr Pro ser Ser Gly Pro Leu Asp Glu Glu Gly Ala Val Ala Pro Val 
165 170 17S 

Leu Ala Asn Gly His Ala Pro Tyr Ser Arg Thr Leu Ser His He Ser 
180 185 190 

Glu Ala Ser Val Asn Ala Ala Leu Ala Glu Ala Ser Val Glu Ala Val 
195 200 205 

Gly Pro Lys Ser Leu Ser Trp Gly Pro Ser Pro Pro Thr His Pro Ala 
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210 215 220 

Pro Thr His Gly Lys His Pro Ser Pro Vai Pro Pro Ala Leu Asp Pro 
225 230 235 240 

Gly His Ser Ala Thr Ser Ser Thr Leu Gly Thr Thr Gly Ser Val Pro 
245 250 2S5 

Thr Ser Thr Asp Pro Ala Pro Ser Ala His Leu Asp Ser Val His Lys 
260 265 270 

Ser Thr Asp Ser Gly Pro Ser Glu Leu Pro Gly Pro Thr His Thr Thr 
275 280 28S 

Thr Gly ser Thr Tyr Ser Ala lie Thr Thr Thr His Ser Ala Pro Ser 
290 295 300 

Pro Leu Thr His Thr Thr Thr Gly Ser Thr His Lys Pro lie He Ser 
305 310 315 320 

Thr Leu Thr Thr Thr Gly Pro Thr Leu Asn lie tie Gly Pro Val Gin 
325 330 335 

Thr Thr Thr Ser Pro Thr His Thr Met Pro Ser Pro Ser Ser His Ser 
340 345, 350 

Asn Ser Pro Gin Tyr Val Asp Phe Cys Ser Ser Val Cys Asp Asn lie 
355 360 365 

Phe Val His Tyr Val He Gly He Phe Phe His Thr Leu Tyr Ser Ser 
370 375 3B0 

Lys Thr Leu 
385 



<210> 361 
<211> 260 
<212> PRT 

<213> Homo sapiens 
<400> 361 

Pro Arg Val Arg Ser lie Lys Val Thr Glu Leu Lys Gly Leu Ala Asn 
1 5 10 15 

Kis Val Val Val Gly Ser Val Ser Cys Glu Thr Lys Asp Leu Phe Ala 
20 25 30 

Ala Leu Pro Gin Val Val Ala Val Asp He Asn Asp Leu Gly Thr He 
35 40 45 

Lys Leu Ser Leu Glu Val Thr Trp Ser Pro Phe Asp Lys Asp Asp Gin 
50 55 60 

Pro Ser Ala Ala Ser Ser Val Asn Lys Ala Ser Thr Val Thr Lys Arg 
65 70 75 80 

Phe Ser Thr Tyr Ser Gin Ser Pro Pro Asp Thr Pro Ser Leu Arg Glu 
85 90 95 
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Gin Ala Phe Tyr Asn Met Leu Arg Arg Gin Glu Glu Leu Glu Asn Gly 
100 105 110 

Thr Ala Trp Ser Leu Ser Ser Glu Ser Ser Asp Asp Ser Ser Ser Pro 
115 120 125 

Gin Leu Ser Gly Thr Ala Arg His Ser Pro Ala Pro Arg Pro Leu Val 
130 135 140 

Gin Gin Pro Glu Pro Leu Pro He Gin Val Ala Phe Arg Arg Pro Glu 
145 150 155 160 

Thx Pro Ser Ser Gly Pro Leu Asp Glu Glu Gly Ala Val Ala Pro Val 
165 170 175 

Leu Ala Asn Gly His Ala Fro Tyr Ser Arg Thr Leu Ser His He Ser 
180 185 190 

Glu Ala Ser Val Asn Ala Ala Leu Ala Glu Ala Ser Val Glu Ala Val 
195 200 205 

Gly Pro Lys Ser Leu Ser Trp Gly Pro Ser Pro Pro Thr His Pro Ala 
210 215 220 

Pro Thr His Gly Lys His Pro Ser Pro Val Pro Pro Ala Leu Asp Pro 
225 230 235 240 

Gly His Ser Ala Thr Ser Ser Thr Leu Gly Thr Thr Gly Ser Val Pro 
245 250 255 

Thr Ser Thr Asp 
260 



<210> 362 
<211> 155 
<212> PRT 

<213> Homo sapiens 
<400> 362 

Tyr Gly Cys Glu Lys Thr Thr Glu Gly Gly Arg Arg Arg Arg Arg Arg 
15 10 15 

Met Glu Ala Val Val Phe Val Phe Ser Leu Leu Asp Cys Cys Ala Leu 
20 25 30 

He Phe Leu Ser Val Tyr Phe lie lie Thr Leu Ser Asp Leu Glu Cys 
35 40 45 



Asp Tyr lie Asn Ala Arg Ser Cys 
50 55 

He Pro Glu Leu He Gly His Thr 
65 70 

Ser Leu His Trp Phe He Phe Leu 
85 



Cys Ser Lys Leu Asn Lys Trp Val 
60 

He Val Thr Val Leu Leu Leu Met 

75 80 

Leu Asn Leu Pro Val Ala Thr Trp 

90 95 
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Asn lie Tyr Arg Tyr lie Met Val Pro Ser Gly Asn Met Gly Val Phe 
100 105 110 

Asp Pro Thr Glu lie His Asn Arg Gly Gin Leu Lys Ser His Met Lys 
115 120 125 

Glu Ala Met lie Lys Leu Gly Phe His Leu Leu Cys Phe Phe Met Tyr 
130 135 140 

Leu Tyr Ser Met He Leu Ala Leu lie Asn Asp 
145 150 155 



<210> 363 
<211> 70 
<212> PRT 

<213> Homo sapiens 
<400> 363 

Ala Arg Ala Pro Ala Pro Ser Leu Pro Pro Leu Pro Ser Pro Ala Pro 
15 10 15 

Ala Leu Ala Pro Ala His Ser Leu Leu Gly Leu Leu Leu Gly Arg Met 
20 25 30 

Ser Gly Ser Ser Leu Pro Ser Ala Leu Ala Leu Ser Leu Leu Leu Val 
35 40 45 

Ser Gly Ser Leu Leu Pro Gly Pro Gly Ala Ala Gin Asn Val Arg Val 
50 55 60 

Gin Ser Gly Gin Asp Gin 
65 70 



<210> 364 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 364 

Gly Thr Ser Lys Asp Cys Val Leu Tyr Ala Phe Leu Asp Pro Gly Met 
15 10 15 

Ala Val Pro Leu Phe Leu Tyr lie Phe Thr Leu Leu Pro Leu Leu Pro 
20 25 30 

Phe Leu Leu Ser Leu Cys Phe Ser Pro Leu Thr Val Lys Arg Ser Ser 
35 40 45 



Ser Ssr Glu Ser Lys Ser Ser Leu 
50 55 



PCTVUS99/1713Q 



Applicant s or ageni s file 37M1DrT jlniemauonalappliciiuon — 

reference rmmbcr PZ031PCT j Unassigned 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(FCTRule \3bis) 



A_ The indications mode below relate to the microorganism referred to in the description 
00 Page 280 Jine <W 



REC'O 1 8 AUG 1999 



W1PO pct 



a IDENTIFICATION OF DEPOSIT 



Name of depositary institution American Type Culture Collection 



Further depositsare tdrnti tied on an additional sheet j | 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit 

July 27, 1998 



Accession Number 

203069 



C ADDITIONAL ^>ff>^CATIO^fSffeav < o/ a rt^^ M r a ppfia 1 W^ This information is amunued on an additional sheet Q 



a DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE in&cotiontart -otforMsigtaKdSt**,, 



Europe 

In respect to thoee designations in which a European Patent is sought a sample of the deposited 

«J T W " ! b t r " ade "™ Me publication of the mention of the grant of the European patent 

the date ™^ * 

me issue of such a sample to an expert nominated by the person requesting the sample (Rule 28(4) EPC) 



E. SEPARATE FURNISHING OF INDICATIONS! ka w blank if not applicable) 



AwT^-r* ^'^ ^ 3UbgUned p lhe toenuateial Bureau la.cr,^^^^^^^ • /4crm r 0 n~ 



Forrecciving Office use only 



This sheet was received with urcimcrnauanal application 



Authorized officer 




Form PCT/RO/134 (July I992) 



For International Bureau use only 



This sheet was recei ved by the International Bureau o 

J8AUGUST 1999 



Authorized officer 



PCT7USW/17130 



CANADA 



The applicant requests that until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement or is withdrawn, the Commissioner of Patents only authorizes the famishing of 
a sample of the deposited biological materia! referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform the 
Iiitemational Bureau accordingly before completion of technical preparations for publication 
of the international application. 

NORWAY 

The ^applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has 
been filed by the applicant, any request made by athird party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual cose. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art. 

UNITED KINGDOM 

applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert. The request to this effect must be filed by the applicant with 
the international Bureau before the completion of the technical preparations for the 
uiternational publication of the application. 



PCT7US99/17130 



DENMARK 



The applicant hereby requests that, until the application has been laid open to public 
.nspect.on (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed byZ 
applicant with the Danish Patent Office not later that at the time when the application is made 
available to the public undeletions 22 and 33(3) of the Danish Patents Acf If S a 

TZfl u b u ^ i" 6d b ^ U,C app,icant - My ^ aest made ^ a third party for the furnishing of 
a sample shall md.cate the expert to be used. That expert may be any person entered on a list 

ssr tau ^ tteDfflishpto,office "" yp ^ b ^ e ^» 



SWEDEN 



The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 

k m , ^ ^ th ° Ut haVinB bcen laid ° pcn to public tas P«*i™. *e furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 16 months from the priority 
date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of die PCT 
Applicant's Guide). If such a request has been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert may 
be any person entered on a list of recognized experts drawn up by the Swedish Patent Office 
or any person approved by a applicant in the individual case. 



NETHERLANDS 



The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 3 1F(1) of the Patent Rules only by the issue of a sample to 
an expert The request to this effect must be furnished by the applicant with the Netherlands 
Industrial property Office before the date on which the application is made available to the 
public under Sechon 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands 
whichever of the two dates occurs earlier. ^ 



PCT/US99/17130 



Applicant's or agent's file 
reference number 



PZ031PCT 



I International appUcatio; 



Unassigned 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 
<PCTRule litis) 



A. The indications made below rclatcto the microorgantsm referred to in J he description 
on page 243 t i l{tc N/A 



a IDENTtFICATIONOFDEPOSiT 



Further deposits ore identified on an additional sheet | [ 



Name of depositary institution American Type Culture Collection 



Address of depositary tntututicm < including postal cade and country} 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit 

Juno 11, 1098 


Accession Number 

209965 


C ADDrriONALINDrCATIONSfWAtonA^nwcppiicflo/fi This information .s continued on an additional sheet □ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE^iWto^^rfl/irf,;^^, 



Europe ' ' ; ~ 

In respect to those designations in which a European Patent is sought a sample of the deposited 
microorganism will be made available until the publication of the mention of the grant of the European patent 
or until the date on which application has been refused or withdrawn or is deemed to be withdrawn, only by 
the issue of such a sample to an expert nominated by the person requesting the sample (Rule 28(4) EPC). 



E. SEPARATE FURNISHING O F INDICATIONS ( leave bfankif not applicable t 
The t 



N^b^aT^^'T^ bCl ° W ^ SUfamiKCd tP ^ lntcrnational Burcau ^risptcfrte^rutmmiKoithtiiukcan^ "Axtwt 




For International Bureau useoniy 



O This sheet was received by the International Bureau o 



_ Authorized officer 



Form PCT7RQ/134 (July 1992) 



PCT/US99/17130 



CANADA 



The : applicant requests that, until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of the deposited biological material referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform the 
Internahonal Bureau accordingly before completion of technical preparations for publication 
or the international application. 

NORWAY 

The applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on the list of 
reoogmzed experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 

AUSTRALIA 



The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Pateuts Regulations). 

FINLAND 



The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert. The request to this effect must be filed by the applicant with 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 



PCT/US99/17130 



DENMARK 



The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that at the time when the application is marie 
available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a 
request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn up by the Danish Patent Office or any person by the applicant in 
the individual case. 



SWEDEN 



The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the furnishing of a sample 
shall only he effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 16 months from the priority 
date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT 
Applicant's Guide). If such a request has been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert may 
be any person entered on a list of recognized experts drawn up by the Swedish Patent Office 
or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on wluch the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 31F(1 ) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands 
whichever of the two dates occurs earlier. 



PCT/US99/1713Q 



Appltcum'ii or agent's file 
reference number 



PZ031PCT 



| International applicauot 



Unassigned 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 
(PCTRule \Sbts) 



A. The indications made below relate to the mjcroorganismrcfciTcd to in the description 
on page j 249 , ine n/a 



B. IDENTIF1CATIONOFDEPOSIT 



Further deposits are identified on an additional sheet j J 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 10-2209 
United States of America 



Date of deposit 



June 25, 1098 



Accession Number 



203027 



C. ADDITIONAL INDICATIONS f/tovc olani if not applicable) This information is com i ruied on an tdditionai sheet Q 



a DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE(ifthcindka,i on ^ no ,f 0 ratld a ign a tedStai a ) 



Europe 

In respect to those designations in which a European Patent is sought a sample of the deposited 
microorganism will be made available until the publication of the mention of the grant of the European patent 
or until the date on which application has been refused or withdrawn or is deemed to be withdrawn, only by 
the issue of such a sample to an expert nominated by the person requesting the sample (Rule 28(4) EPC). 



E. S EPA RATE FTJ RNISHING O F INT) I CATIONS (leave blank if not applicable I 



ho^'oIiJw*? 0 * 1 t>ClOW **" submincd to thc in^raational Bureau later (sptafy the gtnerai namrt vfiht indentions eg., 'Acceaicn 



Forreceiving Office use only 



J^^This sheet was recei ved with the international application 
Authorized officer 

FQTl; 

i PCT7RO/I34 (luly 1992) 




For International Bureau use only 



I I This sheet was received by the International Bureau on: 



- Authorized offtc=r 



PCT/US99/17130 



CANADA 

The applicant requests that, until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of the deposited biological material referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform the 
International Bureau accordingly before completion of technical preparations for publication 
of the international application. 

NORWAY 

The applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert. The request to this effect must be filed by the applicant with 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 



PCTAJS99tt7130 



DENMARK 

The applicant hereby requests that, until the application has been laid open topublic 
inspection (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that at the time when the application is made 
available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a 
request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn up by the Danish Patent Office or any person by the applicant in 
the individual case. y 

SWEDEN * 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 1 6 months from the priority 
date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT 
Applicant' s Guide). If such a request has been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert may 
be any person entered on a list of recognized experts drawn up by the Swedish Patent Office 
or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 3 IF(1) of the Patent Rules only by the issue of a sample to 
an expert The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands 
whichever of the two dates occurs earlier. 



PCTAJS99/17130 



Applicant s or agent's file 
reference number 



P2031PCT 



| InTemau on aJ applicant*. 



Unasstgned 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \2bis) 



A. The indications made bciow relate to the microorganism referred to in 
on page 2S3 hnc 


the description 
N/A 




a IDENTTFICATIONOFDEPOSIT 




Further deposits are identified on on additional sheet 


□ 


Naineofdeposttao'insututton American Type Culture Collection 


Address of depositary institution (including postal code and country) 






10801 University Boulevard 
Manassas, Virginia 201 10-2209 
United States of America 








Date of deposit 

Jury 27, 1996 


Accession Number 

203071 


C. ADDITIONAL INDIC ATIO NSffanv blank if oat applicable) 


This information is continuedon anadditional sheet 


n 



D- DESIGNATED STATES FOR WHICH INDICATIONS ARE MADWthem&caiUmsartnmforaltdejignaratSmeit 



Europe 



to respect to those designations in which a European Patent is sought a sample of the Deposited 
microorganism will be made available until the publication of the mention of the grant of the European patent 
or until the date on which application has been refused or withdrawn or is deemed to be withdrawn, onty by 
the issue of such a sample to an expert nominated by the person requesting the sample (Rule 26(4) EPC) 



E» SEPARATE FUR NISHING OF INDICATIONS (teaveblaidt if not appitcabU) 
The indications I 



Number of Deposit-}** b **° W ** SUbmitled t0 ^ [ntcnia " orial Bureau later ispeafy the gerxmi rvnwc of ike whcxm^Z^ 



For receiving Office use only 



This sheet was received with the international applicatio 



Authorized officer 




Form PCT/RO/134 (luly 1992) 



For International Bureau use only 



□ 

This sheet was recti ved by the International B urcau on: 



Authorized officer 



PCT/US99/17130 



CANADA 



The applicant requests that, until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of the deposited biological material referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform the 
International Bureau accordingly before completion of technical preparations for publication 
of the international application. 



NORWAY 



The applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent * 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 



AUSTRALIA 



The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 



The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art. 

UNITED KINGDOM 



Trie applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert. The request to this effect must be filed by the applicant with 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 



PCT/US99/17130 



DENMARK 



The applicant hereby requests thai, until the application has been laid open to public 
inspection (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that at the time when the application is made 
available to the public under Sections 22 and 33(3) of the Danish Patents Act If such a 
request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn, up by the Danish Patent Office or any person by the applicant in 
the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 16 months from the priority 
date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT 
Applicant's Guide). If such a request has been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert may 

any person entered on a list of recognized experts drawn up by the Swedish Patent Office 
or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 3lF(l) of the Patent Rules only by the issue of a sample to 
an expert The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands 
whichever of the two dates occurs earlier. 



PCT/US99/17130 



Applicant's or agent's file 
reference number 



PZ031PCT 



tntcmahonaJ appltcatior 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 
(PCT Rule I3i£r> 



A. The indications made below tdaic to the microorganism referred to in the description 
onpage .line WA 



EL IDENTIFICATION OFDEPOSIT 



Fimher deposits arc identified on an addirional sheet | ' ) 



Name of depositary institution American Type Culture Collection 



Address of depository institution (inciading poUai code and country) 

10801 University Boulevard 
Manassas, Virginia 20110-2209 
United States of America 



Date of deposit 



July 27. 1998 



Accession Number 



203070 



C ADDITIONAL INDICATIONS (teste blank ifnoiappiicabUt 



This information is continued on an additional sheet j J 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MAJsEi if the indications are mt for ca designated States) 



Europe 

In respect to those designations in which a European Patent Is sought a sample of the deposited 
microorganism will be made available until the publication of the mention of the grant of the European patent 
or until the date on which application has been refused or withdrawn or is deemed to be withdrawn, only by 
the issue of such a sample to an expert nominated by the person requesting the sample (Rule 28(4) EPC). 



E. SEPARATE FURNISHING OF INDlCkTlOmOeanbtaakftwtappikable) 



The indications listed below will be submitted to the International Bureau later {specify the qenend a 
Number of Deposit"} 



r of the indications 'Accanon 



For receiving Office use only 



This sheet was received with i 



Auth on zed officer 




itional application 



Fee International Burcauuseonly 



I | This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/IM (July 1992) 



PCT/US99/17130 



CANADA 



1 he applicant requests that, until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of the deposited biological material referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform the 
International Bureau accordingly before completion of technical preparations for publication 
of the international application. 

NORWAY 

The applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent ' 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act If such a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 



AUSTRALIA 



The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 



The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open lo 
public inspection, the furnishing of a sample shall only be effected to an expert in the art. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert The request to this effect must be filed by the applicant with 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 



PCT/USW/17130 



DENMARK 



The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that at the time when the application is made 
available to the public under Sections 22 and 33(3) of the Danish Patents Act, If such a 
request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn up^by the Danish Patent Office or any person by the applicant in 
the individual case. ' 



SWEDEN 



The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 1 6 months from the priority 
date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT 
Applicant's Guide). If such a request has been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert ma 
be any person entered on a list of recognized experts drawn up by the Swedish Patent Office 
or any person approved by a applicant in the individual case. 

NETHERLANDS 



The apphcant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands 
whichever of the two dates occurs earlier. 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCTA/S99/17130 



A. CLASSIFICATION OF SUBJECT MATTER 

tPC(6) :Pteue Seo Extra Shoot 

US CL PI cmc See Extra Sheet. 
According to Intern ftboufll Patent Class* Gcation (IPC) or to both national classification sad IPC 



a FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 536723.1, 23.5; 435/69.1, 320.1, 252J. 325. 6. 7.1; 330/550, 300. 387.1; 514/2 



Documentation tearched other than minimum documentation to me extent that such do cum cols arc included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
APS. DIALOG - Biotech Files 



C DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



JACOBS, K. A. et al. A Genetic Selection For Isolating cDNAs 
Encoding Secreted Proteins, Gene. 1997, Vol. 198, pages 289- 
296, see entire document. 



Relevant to claim No. 



-23 



□ 

Further documents arc listed in the continuation of Box C. See patent family a 



Spwkt etiogorica of cited ooevmena: 

<*o«umcnl doTinina *« goner*) iuta of Uw in which b not coiutd*ftd 
to be of particular relevance 

Mrtiar doouaeal publi*hod oa or titer 4m krtaraitiorai fillip data 

doawnoat whiefc to my throw doubts on priority claim {»> or w)iki> m 
cited (o «*labtiib <he publication data nf atuwfcer ciuiioo or other 
ipocimJ reuon (*» (pacified} 

retcning to ma oral diacluauro. w, ajOiibittoa «r other 



the priority d 



Uur document ptiblhhul i(W the micmtuortl filing diu or priority 
date and not in conflict with tha •poltcaiioa but cited to widcmtaa 



(be principle or theory utiiarlying tha u 



doctiKtcnt of paiticuUr r«j«*«Dc«; 
contidctwd novel or cannot be cotuidcicd to 
whan, tha doeuMtnt ■> taken eloao 



mvcotian cannot b« 
involve sn inventive tup 



i claimed invention cannot b« 
I to involve an tnventive ttep when tha ^wmmnt b 
with one or raort otticr much document*, such C Ota bin (iion 
being obviout to a percnn Ucillad in tha art 



document com bar of the uioe patent family 



Date of tlic actual completion of tlie in lei national ttcurch 



05 OCTOBER 1999 



Name and matltag address of the ISA/US 
Commissioner of PaUitts and Trademarks 
Box PCT 

Facsimile No. (703) 305-3230 



Date of mailing of the international search report 

21 OCT 1999 



Authorized officer ^/ 

F.I .T^^ RT?ffT^R MM E R ER 
Telephone No. (703) 308-C196 



Form PCTISA^IO (stxond sheetXJuly 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US99/17I30 



Box [ Observation* where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This intern abooal report baa not bcco established in reaped of certain claims under A/tide 17(2X») f°' the following 
1. ["I Claims Not.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



nn Claims Nos.: 1-23 

because ihey relate to part* of the international application that do not com ply with the prescribed require meals to such 
•a extent that no meaningful international search can be carried out, specifically: 

Please See Extra Sheet 



3. Q Claims Noa.: . * 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention it lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as tallows: 



1 ' CD a! ' "^'^ additiono * search fees were timely paid by the applicant, this international search report covers all searchable 



claims. 



searchable claim a could be searched without effort justifying an additional fee. this Authority did not invito pavmcnt 
of any additional fee. 

3. Q At only some of the required additional search fees were timely paid by the applicant this international search report covers 
only those claims for which fees were paid, specifically claims Nos.: 



4 - No requited additional search tees were timely paid by the applicant. Consequently, this international search report is 

restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | J The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet(l)Xiuly 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCTAJS99/17130 



A. CLASSIFICATION OF SUBJECT MATTER: 
IPC (6): 

C12N 1/21. 5/10. 15/11, 15/12. 15/63; A61K 38/16,38/17; CQ7K 14A00, 14/435. 16/00; GO IN 33/50 

A. CLASSIFICATION OF SUBJECT MATTER: 
USCL : 

536/23.1. 23.5; 435/69.1. 320.1. 2523, 325. 6, 7.1; 530/350. 300. 387.1, 514/2 

BOX I. OBSERVATIONS WHERE CLAIMS WERE FOUND UNSEARCHABLE 
3L When do meaningful search could be carried out, specifically: 

AD of the claim j wera unsearchable to the extent that they require lefefence to sequences finn the sequence listing or ao 
, ATCC deposit However, the specific sequence ind deposit oumbers were replaced in the claims with generic 

designators X, Y and Z. Therefore, no meaningful search of the sequences or deposits par se can be carried out by this 
Authority. The subject matter of the chflms has been searched only to the extent possible with reference to the balance 
of the description. 



Form PCTASA/210 (extra *hcct)(July 1992)* 



